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1.  INTRODUCTION 

Lung  cancer  is  the  leading  cause  of  death  from  cancer  in  the  US  and  the  world,  accounting  for  28%  of  all 
cancer  deaths  in  men  and  women  [2].  Lung-cancer  associated  mortality  has  remained  essentially  unchanged 
over  the  last  3  decades,  in  part  because  the  majority  of  lung  cancers  present  at  an  advanced  stage.  Since 
1972,  when  Congress  declared  war  on  cancer,  lung  cancer's  5  year  survival  rate  has  remain  unchanged  at 
15%  while  the  5  year  survival  rates  for  breast,  prostate  and  colon  cancers  have  risen  to  88%,  99%  and  65% 
respectively  [2],  The  major  cause  of  lung  cancer  is  smoking,  yet  only  10-15%  of  smokers  develop  lung  cancer 
[3].  Tobacco  addiction  and  exposure  to  other  lung  cancer  carcinogens  are  serious  problems  among  military 
personnel  and  war  veterans.  In  2005,  the  DOD  reported  that  over  32%  of  military  personnel  smoke.  Although 
this  is  a  substantial  decrease  when  compared  to  a  survey  done  in  1980,  the  smoking  rate  among  the  general 
population  in  2005  was  only  21%  according  to  a  report  by  the  Centers  for  Disease  Control  and  Prevention. 
Smoking  among  military  personnel  has  increased  since  the  late  1990s  in  association  with  conflicts  in 
Afghanistan  and  Iraq  and  is  50%  higher  in  deployed  vs.  non-deployed  personnel  with  smoking  rates  in  20-25 
year  olds  of  37%  vs.  20%  in  the  civilian  population.  Even  conservative  estimates  place  the  cost  of  lung  cancer 
to  the  military  at  $1  billion  a  year  and  this  cost  will  only  increase  with  this  wave  of  new  smokers.  The  optimal 
treatment  for  non-small-cell  lung  cancer  (NSCLC)  is  surgical  resection;  however,  75%  of  patients  are  ineligible 
because  of  advanced  disease. 

The  results  of  National  Lung  Screening  Trial  (NLST)  that  utilized  low  dose  helical  computed  tomography 
(LDCT)  screening  in  a  population  of  heavy  current  or  former  smokers  were  recently  published  and 
demonstrated  a  20%  reduction  in  lung  cancer  mortality  and  a  6.7%  decrease  in  all-cause  mortality  with  LDCT 
relative  to  CXR  [4,  5].  This  is  the  single  most  significant  advance  in  reducing  lung  cancer  mortality  since  1964, 
when  the  United  States  Surgeon  General  first  publicized  the  causal  relationship  between  lung  cancer  and 
cigarette  smoking,  prompting  a  decline  in  active  smoking  in  the  US.  However,  with  the  reported  96.4%  false 
positives  rate  there  is  a  pressing  need  to  develop  reliable  molecular  biomarkers  to  supplement  the  radiologic 
imaging  to  improve  the  sensitivity  of  early  detection  of  lung  cancer.  Based  on  the  notions  that:  1 )  smoking- 
induced  injury  alters  mRNA  and  microRNA  (miRNA)  expression  profiles  in  airway  epithelium  [1,  6,  7],  and  2) 
these  changes  can  be  detected  and  serve  as  biomarker  for  early  detection  of  lung  cancer  [8,  9],  in  the  current 
project  we  have  focused  on  investigation  of  the  molecular  events  associated  with  field  cancerization  to  identify 
biomarkers  of  early  lung  carcinogenesis. 

In  Specific  Aim  1  of  this  program,  high-throughput  microarray  mRNA  expression  analyses  have  been 
performed  on  cytological  specimens  (brushings)  obtained  at  intraoperative  bronchoscopy  from  the  main  carina 
and  main  ipsilateral  bronchus,  as  well  as  on  specimens  obtained  at  lobectomy  procedures  from  the  main  lobe 
bronchus  (adjacent  to  SCCs),  sub-segmental  bronchus  (adjacent  to  adenocarcinomas)  and  from  the  resected 
NSCLC  tumors.  Towards  this  aim,  we  compared  and  contrasted  global  gene  expression  patterns  across  all  the 
specimens  from  the  entire  field  and  corresponding  NSCLC  tumors.  We  developed  lung  adenocarcinoma  and 
see  field  cancerization  signatures  signifying  the  differential  mRNA  expression  patterns  between  the  carina 
and  the  subsegmental  bronchus  and  main  lobe  bronchus,  respectively.  In  addition,  similar  expression  profiles 
between  the  carina  and  resected  NSCLC  tumors  have  been  integrated  with  available  gene  expression  data  of 
bronchial  brushings  from  the  main  carina  isolated  at  various  time  points  post-surgery  from  40  NSCLC  patients; 
Department  of  Defense  (DoD)  VITAL  patients.  Promising  markers  derived  from  this  study  are  being  validated 
at  the  mRNA  and  protein  level  in  histological  tissue  specimens.  Moreover,  we  performed  RNA-sequencing  and 
microarray  profiling  of  nasal  epithelia,  airway  epithelial  cells  collected  from  both  bronchoscopy  and  lobectomy 
specimens  as  well  as  of  corresponding  tumors  (NSCLC  patients)  or  benign  lesions  (cancer-free  individuals). 

In  Specific  Aim  2,  we  utilized  laser  capture  microdissection  to  obtain  specific  cell  populations  (basal  cells  or 
type  II  alveolar  cells,  depending  on  the  NSCLC  histology/location)  as  well  as  premalignant  lesions  and 
epithelial  components  of  the  tumors.  These  cell  populations  have  been  profiled  with  RNA-seq  to  determine 
their  gene  expression  signatures  to  increase  our  understanding  of  premalignancy.  We  identified  the  gene 
expression  profiles  that  are  associated  with  progression  from  a  benign  cell  population  to  premalignancy  and 
with  progression  from  a  benign  cell  population  to  true  malignancy.  Additionally  we  performed  whole  exome 
DNA  sequencing  of  matched  sets  of  premalignant  lesions,  adenocarcinoma  in  situ,  lung  cancer  and  adjacent 
normal  tissues  isolated  from  the  same  patients  to  determine  the  role  of  somatic  mutations  in  lung  cancer 
initiation  and  progression. 
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In  Specific  Aim  3,  we  utilized  expression  signatures  and  biomarkers  derived  from  the  results  of  aims  1  and  2  to 
develop  and  test  airway-based  biomarkers  capable  of  diagnosing  lung  cancer  in  current  or  former  smokers 
using  minimally  invasive  sites. 

2.  KEYWORDS 

Lung  cancer,  biomarkers,  miRNA  profiling,  protein  profiling,  expression  analysis,  whole  exome  sequencing, 
early  detection. 

3.  OVERALL  PROJECT  SUMMARY 


Specific  Aim  1:  To  increase  our  understanding  of  the  molecuiar  basis  of  the  pathogenesis  of  lung 
cancer  in  the  “field  cancerization”  that  develops  in  current  and  former  smokers. 


Summary  of  Research  Findings 


A.  To  increase  our  understanding  of  the  molecular  basis  of  the  pathogenesis  of  lung  cancer  in  the 
“field  cancerization”  that  develops  in  current  and  former  smokers. 


Collection  of  airway  epithelial  samples  from  both 
bronchoscopy  and  lobectomy  specimens  from 
smokers  with  and  without  lung  cancer. 

Over  the  course  of  this  grant,  we  were  able  to  recruit 
37  subjects  undergoing  resection  of  lung  tumor  or 
benign  lung  lesions  across  four  institutions 
(MDAnderson,  Boston  University,  University  of 
California  Los  Angeles,  and  Vanderbilt  University)  for 
the  purposes  of  studies  in  Aim  1  (Table  1).  We 
collected  biospecimens  from  seven  locations:  nasal 


Cancer  (n  =Z8) 

No  Cancer  (n=9] 

p  value 

Age 

65  (11) 

52  (16) 

0,05 

Gender 

10  M,  18  F 

2  M,  7  F 

0.69 

Smoking  Status 

5  C,  21  Fr,  2  N 

2  C,  4  Fr,  3  N 

0.11 

Cancer  type 

16  ADC,  12  SCC 

N/A 

N/A 

Table  1.  Subject  Demographics.  Age  is  listed  as  the 
average  (standard  deviation).  C,  current;  Fr,  former;  N, 
never.  ADC,  adenocarcinoma;  SCC,  squamous  cell 
carcinoma. 


epithelium,  proximal  and  distal  bronchial  airway  epithelium  obtained  at  bronchoscopy  (ipsilateral  and 
contralateral  to  the  tumor)  as  well  as  the  tumor/benign  lesion,  adjacent  normal  parenchyma,  and  sub- 
segmental  bronchial  epithelium  at  time  of  lobectomy  (Figure  1). 


High  throughput  gene  expression  profiling  (Aim1A). 

Total  RNA  was  isolated  and  profiled  by  microarray  (Affymetrix  GeneChip  Human  Gene  2.0  ST).  Microarrays 
were  normalized  and  outliers  were  removed  based  on  the  Relative  Log  Expression  (RLE),  Normalized 
Unsealed  Standard  Error  (NUSE)  quality  metrics  and  Principal  Component  Analysis  (PCA).  Using  linear 
modeling  we  compared  patients  with  cancer  to  those  with  benign  lung  disease  to  identify  genes  with  cancer- 
associated  gene-expression  across  sites.  We  identified  two  patterns:  1)  cancer-associated  expression 
differences  that  changed  with  distance  from  tumors  (field  of  cancerization,  47  genes),  and  2)  genes 
consistently  associated  with  cancer  across  the  sites  (field  of  injury,  106  genes).  Figure  2  shows  the  field 
cancerization  effect  in  which  cancer-associated  changes  in  gene  expression  in  the  proximal  airway  fall  off  as 
distance  from  the  tumor  increases.  This  pattern  is  not 
observed  in  patients  without  cancer.  The  second  pattern  of 
gene  expression  (genes  consistently  changed  across  the 
thoracic  airway  depending  on  the  presence  or  absence  of  a 
tumor)  are  significantly  enriched  by  GSEA  among  genes  we 
previously  found  to  be  altered  in  the  large  airway  of  patients 
with  cancer  (pval  <  0.001,  Figure  3).  In  contrast,  genes  that 
change  in  a  gradient  like  manner  (field  of  cancerization,  47 
genes)  were  not  enriched  in  that  previously  published  large 
airway  dataset.  These  findings  suggest  that  the  genes  we 
identified  changing  consistent  throughout  the  airway  (field  of 
injury,  106  genes)  hold  the  potential  to  serve  as  early 
detection  biomarkers  for  lung  cancer  in  the  relatively 
accessible  proximal  airway. 


Brushingsr 

A  =  Nose 
B  =  T  rachea 
C  =  Contralateral  MS 
D  =  MS 

E  -  Small  airway 

Tissue' 

F'  Tumor/Nodlule 
Gr  Adjacent  Worm  a  I 


Figure  1.  Overview  of  biospecimen  coiiection  sites. 


5 


Army  Award  W81 XWH-1 0-1  -1 006 

Comprehensive  Final  Report:  Reporting  Period  20  Sept  2010  -  19  Sept  2014 


=  tumor  /  benign  nodule 
■=  small  airways  near  tumor/nodule 
•=  ipsilateral  main  bronchus 
■=  contralateral  main  bronchus 
■=  trachea 
■=  nose 


Small  airways  Large  airways 
Near  tumor/nodule 


No  Cancer 


Figure  2.  Gene  expression  changes  associated 
with  the  “Fieid  of  Cancerization  Effect”.  47  genes 
shown  are  p  <  0.005  for  the  interaction  term  and  p  < 
0.005  for  the  cancer  term. 


Figure  3.  Genes  consistentiy  changing 
throughout  the  fieid  of  injury  are  abie  to 
distinguish  iarge  airway  epitheiium  from 
smokers  with  and  without  iung  cancer 
in  an  independent  dataset  [1].  Heatmap 
showing  expression  levels  for  leading  edge 
genes  from  our  “field  of  injury”  signature  in 
bronchial  airway  samples  collected  from 
smokers  undergoing  bronchoscopy  for 
suspicion  of  lung  cancer.  Black  =  patients 
diagnosed  with  cancer,  Pink  =  patients 
diagnosed  with  benign  lung  disease.  Red  = 
increased  expression,  Blue=decreased 
expression. 


Additionally,  we  have  performed  RNA-seq  on  a  subset  of  the  specimens  from  subjects  (n  =  17)  run  on  arrays  in 
order  to  both  confirm  the  gene  expression  signatures  identified  by  microarrays  as  well  as  identify  additional 
transcripts  within  the  field  of  injury  not  interrogated  by  the  microarrays. 

B.  Gene  expression  analysis  of  the  field  cancerization  in  early-stage  NSCLC  patients 

Spatiotemporal  field  of  cancerization  expression  profiles. 


Gene  expression  alterations  in  response  to  cigarette  smoke 
have  been  characterized  in  normal-appearing  bronchial 
epithelium  of  healthy  smokers,  and  it  has  been  suggested  that 
adjacent  histologically  normal  tissue  displays  tumor- 
associated  molecular  abnormalities  [10,  11].  We  sought  to 
delineate  how  the  field  of  injury  evolves  in  space  and  in  time  in 
NSCLC  patients  following  curative  surgery.  In  Years  01-02  of 
this  program,  we  performed  expression  profiling  of  samples 
from  smoker  patients  who  were  accrued  into  a  surveillance 
clinical  trial  for  annual  follow-up  and  bronchoscopies  within  1 
year  after  definitive  surgery.  Bronchial  brushings  and  biopsies 
were  obtained  from  six  different  sites  in  the  lung  at  the  time  of 
inclusion  in  the  study  and  at  12,  24,  and  36  months  after  the  first  time  point  (Figure  4)  [12].  Our  study 
pinpointed  spatial  and  temporal  cancer-associated  expression  alterations  in  the  molecular  field  of  injury  of 
patients  with  early-stage  NSCLCs  after  definitive  surgery  that  enrich  the  molecular  definition  of  the  airway  field 
of  cancerization  and  establish  new  paradigms  for  the  patient  at  risk  for  lung  cancer  [12,  13]  (see  Reportable 
Outcomes  and  Appendix). 

Transcriptomic  architecture  of  the  adjacent  airway  fieid  of  cancerization. 


rarlna  AM  C 

Analysis  of  SITE-dependent 

CONTF^ 

y\m4ff*ON.ADJ 

gene  expression  patterns  (ADJ, 

Right  NjV) 

(Mm/ 

NON-ADJ,  MC,  CONTRA) 

Upper  Lobes^ 
Right 

Lobe 

Middle 

Global  gene  expression 

analysis 

LowUlie-^® 

^  ‘•eft 

lA  '^/fvv^^LowefLobe 

Analysis  of  TIME-dependent 

j 

Resectela  it 

umor* — ^ 

gene  expression  patterns 

(baseline,  12,  24  and  36 

months. 

Figure  4.  Schematic  illustrating  sampling  and  collection  of 

bronchial  brushes  from  different  airway  sites  at  different  time 

points  following  resective  surgery  from  early  stage  smoker 

NSCLC  patients  enrolled  in  the  phase  II  cancer  surveillance 

Vanguard  study. 
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We  also  sought  to  characterize  the  yet  unknown  global 
molecular  and  adjacent  airway  field  cancerization  in 
early-stage  NSCLC.  In  Years  01-03  of  this  program, 
we  performed  whole-transcriptome  expression  profiling 
of  resected  early-stage  (l-IIIA)  NSCLC  specimens 
(n=20)  with  matched  tumors,  multiple  cytologically 
controlled  normal  airways  with  varying  distances  from 
tumors  and  uninvolved  normal  lung  tissues  (n=194 
samples)  using  the  Affymetrix  Human  Gene  1.0  ST 
platform.  A  schematic  of  the  recently  published  [14] 
study’s  design  is  represented  in  Figure  5.  We  identified 
differentially  expressed  gene  features  (n  =  1661) 
between  NSCLCs  and  airways  compared  with  normal 
lung  tissues,  a  subset  of  which  (n  =  299),  after  gene  set 
enrichment  analysis,  statistically  significantly  (P  <  .001) 
distinguished  large  airways  in  lung  cancer  patients 
from  airways  in  cancer-free  smokers.  In  addition,  we  identified  genes  (n  =  422)  statistically  significantly  and 
progressively  differentially  expressed  in  airways  by  distance  from  tumors  that  were  found  to  be  congruently 
modulated  between  NSCLCs  and  normal  lung  tissues.  Furthermore,  LAPTM4B,  with  statistically  significantly 
increased  expression  (P  <  .05)  in  airways  with  shorter  distance  from  tumors,  was  up-regulated  in  human 
immortalized  cells  compared  with  normal  bronchial  epithelial  cells  (P  <  .001)  and  promoted  anchorage- 
dependent  and  -independent  lung  cancer  cell  growth.  Our  studies  demonstrate  that  the  adjacent  airway  field  of 
cancerization  comprises  both  site-independent  profiles  as  well  as  gradient  and  localized  airway  expression 
patterns  [14].  Profiling  of  the  airway  field  of  cancerization  may  provide  new  insights  into  NSCLC  oncogenesis 
and  molecular  tools  for  detection  of  the  disease  [141  (see  Reportable  Outcomes  and  Appendix). 

Our  microarray  analysis  also  pinpointed  to  a  decreased  expression  of  the  lung-specific  tumor  suppressor  gene 
G-protein  coupled  receptor  family  C,  group  5,  member  A  {GPRC5A)  in  tumors  and  airways  compared  to 
uninvolved  normal  lung  tissue.  GPRC5A  was  identified  as  a  lung-specific  tumor  suppressor  gene  evidenced  by 
spontaneous  and  tobacco-driven  adenocarcinoma  formation  in  mice  with  knockout  of  both  of  the  gene’s  alleles 
[15-19].  GPRC5A  was  shown  to  exert  its  tumor  suppressive  function,  in  part,  by  inhibition  of  nuclear  factor- 
kappa  B  (NFkB)  and  downstream  inflammation  [16].  In  Year  02  of  this  program,  we  analyzed  GPRC5A 
expression  in  the  molecular  field  cancerization  associated  with  chronic  obstructive  pulmonary  disease  (CORD), 
a  risk  factor  for  lung  cancer  that  is  typically  associated  with  inflammation  [20].  Quantitative  real-time  PCR 
(QRTPCR)  in  an  independent  set  of  samples  demonstrated  that  GPRC5A  expression  was  significantly 
decreased  in  the  molecular  localized  field  cancerization  [20].  GPRC5A  airway  expression  was  highest  in 
airways  from  cancer-  and  COPD-free  smokers,  decreased  in  airways  of  COPD  patients  (p  =  0.004)  and  lowest 
in  airway  epithelia  of  COPD  patients  with  adenocarcinoma  and  SCC  (P  <  0.0001)  [20]  (see  Reportable 
Outcomes  and  Appendix). 

Somatic  point  mutations  and  allelic  imbalance  in  the  airway  field  of  cancerization. 
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Our  findings  on  the  transcriptomic  architecture  of  the  airway  field 
of  cancerization  in  this  DoD-funded  program  prompted  us  to 
perform  preliminary  studies  in  Year  04  to  investigate  somatic 
genomic  alterations  in  the  airway  field.  In  a  pilot  study,  we 
performed  deep  sequencing  of  a  panel  of  409  cancer-associated 
genes  using  the  Ion  Proton  sequencing  platform  (Life 
Technologies),  in  samples  from  several  lung  adenocarcinoma 
(LUAD)  patients  consisting  of  multi-region  tumor  core  needle 
biopsies  (CNBs),  normal-appearing  small  airways  (SA)  adjacent 
to  the  LUADs,  normal  large  airways  (LA),  nasal  epithelia  and 
white  blood  cells.  We  achieved  an  average  depth  of  900X  and 
80%  of  the  samples  achieved  >500X  across  80%  of  the  capture 
region.  We  applied  the  Genome  Analysis  Toolkit  (GATK)-based 
workflow  [21],  running  MuTect  [22]  to  infer  acquired  mutations 
with  the  white  blood  cell-derived  DNA  as  the  paired  normal 
sample.  We  obtained  counts  of  somatic  variants  for  each  non¬ 
blood  sample.  We  found  that  there  were  some  mutations  were 
shared  between  the  tumor  and  adjacent  “normal"  airway 

samples,  while  others  were  unique  (private)  to  these  regions. 

There  appeared  greater  sharing  between  the  adjacent  “normal” 
airway  and  LUAD  than  between  the  tumor  and  either  the  large 
airway  (LA)  or  nasal  epithelium.  Figure  6A  depicts  mutations  in 
fibroblast  growth  factor  receptor  4  {FGFR4,  red)  and  TATA  box 
binding  protein-associated  factor  (TAFfL;  blue)  that  were  found 
to  be  shared  between  LUAD  CNBs  and  adjacent  “normal”  small 
airways  (SA).  The  variant  alleles  for  the  FGFR4  mutations  were 
found  in  decreasing  frequency  from  some  LUAD  CNBs  to  the 
SA,  LA,  nasal  (and  was  not  observed  in  the  blood),  fitting  the  profile  of  a  spatial  field  cancerization  mutation 
(Figure  6A;  pt.  A).  We  also  performed  a  pilot  genome-wide  SNP  array  profiling  study  and  used  hapLOH,  a 
computational  tool  developed  by  the  Scheet  laboratory  [23-25],  at  MD  Anderson  Cancer  Center,  that 
incorporates  information  about  germline  haplotypes  and  allows  detection  of  aberrant  DNA  particularly  in 
settings  when  there  exists  clonal  mosaicism  in  adjacent  normal  tissue,  which  we  anticipate  in  the  field 
cancerization.  Using  hapLOH  we  identified  somatic  allelic  imbalance  events  in  the  airway  field  cancerization. 
Figure  6B  displays  an  event  in  chromosomal  region  9q  that  is  shared  between  the  tumor  and  the  adjacent 
normal-appearing  airway  in  two  LUAD  patients.  This  event  was  not  detected  in  white  blood  cells  (Figure  6)  or 
in  uninvolved  and  distant  normal  lung 
tissues.  Our  findings  pinpoint  somatic 
genomic  aberrations  in  the  normal¬ 

appearing  airway  field  of  cancerization. 

These  findings  were  submitted  as  an 
abstract  for  presentation  in  the  forthcoming 
American  Association  for  Cancer 
Research  Annual  meeting. 


Role  of  LAPTM4B  field  cancerization 
marker  in  lung  cancer  pathogenesis. 

One  of  the  top  aberrant  gradient  and 
localized  field  cancerization  markers  from 
our  recent  study  [14]  was  the  lysosomal 
membrane  associated  putative  oncogene 
LAPTM4B.  LAPTM4B  was  recently  shown 
to  promote  autophagy  for  breast  cancer 
cell  survival  and  to  mediate  breast  cancer 
chemoresistance  [26,  27].  In  Years  03-04 
of  this  program,  we  sought  to  study  the 
role  of  LAPTM4B  in  lung  cancer 
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Figure  6.  Genomic  aiterations  in  the  airway 
fieid  cancerization.  A.  Somatic  point  mutations 
icdentifiecd  by  (deep  sequencing  antd  present  in  both 
NSCLCs  an(d  acdjacent  normal  airways.  B. 
Icdentification  by  hapLOH  (reed  vertical  lines)  of 
tumor-associatecd  somatic  chromosomal 
aberrations  in  “normal”  airways  of  two  patients 
(left  an(d  right).  Black  arrows  antd  asterisks 
intdicate  allelic  imbalance  in  9q  fountd  in  both 
NSCLCs  (top  )  an(d  airways  (mitdcdle)  antd  not 
fountd  in  bloocd  (bottom).  CNBs,  core  neecdle 
biopsies;  SA,  small  acdjacent  airway;  LA,  large 
airway. 
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pathogenesis.  As  detailed  in  our  previous  annual  report  (Year  03),  we  already  demonstrated  that  1) 
knockdown  of  LAPTM4B  suppressed  anchorage-dependent  and  -independent  lung  cancer  cell  growth;  2) 
LAPTM4B  expression,  when  analyzed  by  in  situ  hybridization  in  NSCLC  tissues,  was  predictive  of  poor  survival 
and  prognosis;  3)  the  anti-growth  effects  of  LAPTM4B  knockdown  were  substantially  larger  when  cells  were 
cultured  in  medium  lacking  serum  compared  to  when  cells  were  cultured  in  serum-containing  medium;  4) 
LAPTM4B  activates  cellular  autophagy  to  promote  lung  cancer  cell  survival  following  serum  starvation  and  5) 
LAPTM4B  activated  and  up-regulated  the  nuclear  factor  erythroid  2-like  2  (NFE2L2  also  known  as  NRF2) 
transcription  factor  along  with  the  NRF2-mediated  response  as  evidenced  by  functional  pathways  analysis  of 
expression  profiles  of  cells  with  and  without  knockdown  of  LAPTM4B. 


In  Year  04  of  the  program,  we  further 
explored  the  interplay  between  LAPTM4B 
and  the  NRF2-mediated  stress  response. 

QRTPCR  analysis  of  the  NRF2  target 
heme  oxygenase  1  {HMOX1) 
demonstrated  that  mRNA  levels  of  the 
NRF2  target  HMOX1  were  significantly 
increased  after  48  h  and  72  h  of  serum 
withdrawal.  Knockdown  of  LAPTM4B 
significantly  attenuated  HMOX1  induction 
by  serum  withdrawal  (Figure  7).  In 
addition,  knockdown  of  LAPTM4B 
concomitantly  reduced  the  expression 
levels  of  the  NRF2  transcription  factor 
itself. 

We  then  sought  to  assess  the  effect  of 
LAPTM4B  expression  on  the  NRF2 
transcription  factor  itself.  We  first 
performed  chromatin  immunoprecipitation 

(Chip)  analysis  of  the  effect  of  LaPTM4B  knockdown  on  the  transactivation  of  HMOX1  by  the  NRF2 
transcription  factor.  Using  primers  covering  NRF2-binding  sites  in  the  HMOX1  promoter,  ChIP  demonstrated 
that  LAPTM4B  knockdown  decreased  serum  starvation-induced  transactivation  of  HMOX1  by  NRF2  evidenced 
by  reduced  binding  of  the  transcription  factor 
to  the  HMOX1  promoter  (Figure  8A). 

Moreover  and  in  accordance  with  QRTPCR 
and  Chip  analyses,  knockdown  of  LAPTM4B 
attenuated  nuclear  accumulation  of  NRF2 
protein  by  serum  starvation  as  evidenced  by 
western  blotting  of  nuclear  cell  fractions 
(Figure  8B)  and  by  ICC  analysis  of  the  co¬ 
localization  of  NRF2  with  the  nuclear  marker 
histone  2B  (Figure  8C). 

Furthermore,  reciprocal  effects  were 
observed  in  cells  transfected  with  LAPTM4B- 
overexpressing  vectors.  Over-expression  of 
LAPTM4B  significantly  augmented  HMOX1 
and  NRF2  induction  by  serum  starvation 
(Figures  9A-C)  as  well  as  the  nuclear 
accumulation  of  NRF2  (Figure  9D).  It  is 
noteworthy  that  while  modulation  of 
LAPTM4B  expression  did  not  affect  levels  of 
HMOX1  in  cells  at  basal  conditions  and 
cultured  in  FBS-containing  medium  (Figure 
9C),  LAPTM4B  expression  positively 
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Figure  9.  Over-expression  of  LAPTIVI4B  increases  NRF2 
activation  and  HMOX1  expression.  QRTPCR  analysis  of 
LAPTM4B  (A),  NFE2L2  (B)  and  HMOX1  (C)  mRNA  levels  in 
Calu6  lung  cancer  cells  following  LAPTM4B  over-expression  in 
the  presence  and  absence  of  10%  FBS.  D.  Western  blot  analysis 
of  nuclear  and  total  NRF2  protein  in  Calu6  lung  cancer  cells 
following  LAPTM4B  over-expression  in  the  presence  and  absence 
of  10%  FBS. 


with  10%  FBS 


DAPI  Histone  H2B  NRF2 


□API  Histone  H2B  NRF2 


with  1 0% . 
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without  1 0% 
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Figure  8.  LAPTM4B  promotes  nuclear  localization  of  the  NRF2 
transcription  factor.  A.  ChIP  analysis  of  NRF2  protein  binding  to 
HMOX1  promoter  in  Calu6  lung  cancer  cells.  B.  Western  blot 
analysis  of  nuclear  and  total  NRF2  protein  following  LAPTM4B 
knockdown  in  presence  and  absence  of  10%  FBS.  C.  ICC  analysis 
of  NRF2  nuclear  localization  following  LAPTM4B  knockdown  in 
presence  and  absence  of  10%  FBS. 
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controlled  nuclear  levels  of  the  transcription  factor  NRF2  (Figure  9D).  It  is  reasonable  to  surmise  that  HMOX1 
induction  by  NRF2  is  independent  on  the  levels  of  the  latter  transcription  factor  but  rather  dependent  on  serum 
deprivation-induced  stress.  Our  findings  point  to  a  novel  intracellular  mechanism,  which  involves  the  LAPTM4B 
field  cancerization  marker,  for  control  of  the  NRF2  transcription  factor  during  basal  conditions  and  cellular 
stress  (e.g.  nutrient  deprivation).  These  findings  were  presented  in  the  past  2014  American  Association  for 
Cancer  Research  Annual  meeting  [28]  and  are  currently  being  prepared  for  publication. 

C.  Investigating  molecuiar  signature  of  iung  cancer  development  in  bronchial  specimens  by  shotgun 
proteomics. 

A  major  challenge  in  reducing  mortality  from  lung  cancer  hinges  on  identifying  at-risk  population.  The  bronchial 
epithelium  may  serve  as  surrogate  tissue  for  biomarker  of  risk.  Our  objective  was  to  identify  a  signature  of  risk 
from  proteomic  alterations  in  the  cytologically  normal  airway  epithelium  from  individuals  at  risk  for  developing 
lung  cancer.  We  have  identified  2792  protein,  the  largest  to  our  knowledge,  in  bronchial  brushings  specimens 
collected  from  risk  stratified  individuals  by  shotgun  proteomics.  Joncheere-Terpstra  trend  test  among  low, 
medium  and  high  risk  groups  resulted  significantly  altered  expression  of  316  proteins  (trend  p  <0.05)  with  238 
up  and  78  down  trends.  Pathway  enrichment  analysis  revealed  up-regulation  of  carbohydrate  metabolic 
pathways  in  the  airways  of  high  risk  individuals.  Expression  of  carbohydrate  metabolic  enzymes,  lactate 
production  and  glucose  consumption  were  increased  in  in  vitro  culture  of  human  bronchial  epithelial  cells 
treated  with  cigarette  smoke  condensate.  Up-regulation  of  these  enzymes  and  other  selected  candidate 
proteins  were  validated  by  parallel  reaction  monitoring  mass  spectrometry  and  tissue  microarray  of 
histologically  normal  bronchial  tissues  from  risk  stratified  individuals.  We  identified  a  signature  of  lung  cancer 
risk  assessment  that  may  provide  the  basis  of  patient  selection  for  surveillance  programs  and 
chemoprevention.  Pathway  enrichment  analysis  as  well  as  in  vitro  data  suggests  possible  metabolic 
reprogramming  in  the  airway  epithelium  of  high  risk  individuals. 

Methods 

Flere  we  hypothesized  that  proteomic  alterations  in  the  histologically  normal  airway  epithelium  from  risk 
stratified  individuals  allow  us  to  derive  a  signature  of  risk  of  developing  lung  cancer. 

Specimen  collection  procedure:  Bronchial  brushings  were  collected  from  patients  under  conscious  sedation. 
The  brushings  were  immediately  dipped  into  1.5  ml  saline  taken  in  a  labeled  eppendorf  tube.  The  tube  was 
kept  on  ice  to  minimize  protease  action.  Care  was  taken  to  keep  the  brush  specimen  free  from  blood. 
Brushings  in  the  saline  was  vigorously  agitated  by  vortexing  for  about  10  seconds  with  highest  speed.  It  was 
then  spun  1500g  for  10  minutes  in  a  microcentrfuge  with  the  brush  inside  the  tube.  Supernatant  was  removed 
carefully  leaving  as  little  saline  as 
possible  keeping  the  brush  inside  the 
tube.  The  pellet  was  stored  in  freezer 
at  -80°C  temperature.  Patients 
undergoing  autofluorescence 

bronchoscopy  for  clinical  suspicion  of 
lung  cancer  agreed  to  provide 
bronchial  biopsy  specimens  at 
predetermined  normal  sites  (with 
normal  fluorescence  ratio).  Biopsy 
specimens  collected  for  research  were 
snap  frozen  and  stored  in  -80°C 
freezer.  Patients  characteristics  are  in 
Table  2. 


Data  acquisition:  Proteins  were  extracted  and  fractionated  as  described  previously.  Equal  amounts 
(micrograms)  of  proteins  from  each  specimen  of  each  group  were  pooled,  divided  into  three  technical 
replicates  and  each  replicate  was  fractionated  by  isoelectric  focusing  (lEF).  Shotgun  proteomic  data  from  each 
of  the  fifteen  lEF  fractions  were  collected  by  LC  MS/MS  (Velos).  A  combination  of  database  search  tools  were 
employed  to  identify  peptide  sequence  present  in  a  protein/protein  groups.  Normalized  spectral  counts  of  the 
digested  proteins  were  considered  as  the  basis  of  protein  abundance  measurement.  Protein  groups  were 
identified  by  peptide  matching,  database  search,  filtration  and  parsimonious  analyses  using  search  tools 


Table2.  Characteristics  of  individuals  providing  bronchial  brushings  for  the  training  set 


Risk  levels 

Characteristics 

low  (n=5) 

Medium  (n=5) 

High  (n=5) 

Age 

Average+stdev 

58.2+4.7 

59.4+10.7 

72+5.2 

Median  (range) 

58  (53-65) 

59  (47-74) 

69  (68-78) 

Gender 

Male 

4 

2 

3 

Female 

1 

3 

2 

Smoking  status 

Never  smoker 

5 

0 

0 

Ex-smoker 

0 

5 

4 

Current  smoker 

0 

0 

1 

Pack  year,  avg+stdev 

n/a 

21.5+24.1 

62.8+25.4 

Pack  year,  median  (range) 

n/a 

20  (1-60) 

50  (50-108) 
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developed  at  Vanderbilt  University  Medical  Center  (18-21).  A  protein  group  may  include  isoforms,  variants  or  in 
rare  occasions,  unrelated  proteins  containing  identical  peptide  sequence.  Lisa  and  Rob  to  write  the  detail  about 
sample  preparation  for  LC  MS/MS  analysis.  lEF  fractionation  of  peptide  digests  and  LC  MS/MS  data 
acquisition  and  analysis.  Peptide  identification  from  LC  MS/MS  data,  protein  assembly  and  filtering. 

Statistical  Methods:  Joncheere-Terpstra  Trend  analysis  Ming  to  write  (groups  1,  2  and  3  Trend  analysis). 
Validation  of  selected  identified  proteins  by  parallel  reactions  monitoring  (PRM)  analysis.  Overexpressed 
carbohydrate  metabolic  enzymes  in  biopsies  collected  from  individuals  who  provided  bronchial  brushings  for 
the  training  set  by  Western  blotting  using  matched  biopsies. 

Results 

We  have  identified  2792  protein  groups  in  the  bronchial 
brushings  specimen  of  risk  stratified  individuals  by  LC 
MS/MS  shotgun  proteomics  (Table  3).  Volunteers/subjects 
were  categorized  as  low,  medium  and  high  risk  groups 
using  the  lung  cancer  screening  tool.  Among  the  identified 
proteins  in  each  group  50  to  60%  were  identified  in  each  of 
the  three  replicates.  Each  protein  was  identified  by  two  or 
more  peptide  sequences.  Number  of  protein  groups 
identified  separately  in  groups  1,  2  and  3  were  2116,  1949 
and  2771  respectively.  Reproducibility  of  protein 
identifications  in  all  three  replicates  were  49%,  48%  and 
61%  in  groups  1,  2  and  3  respectively.  To  discover  significantly  dysregulated  proteins  in  the  bronchial 
epithelium  of  Individuals  at  risk  for  lung  cancer  development  we  performed  Jonckheere-Terpstra  trend  test  with 
2526  identified  proteins.  Number  of  proteins  with  statistically  significant  (trend  p  <0.05)  increasing  and 
decreasing  trends  were  316.  Normalized  spectral  counts  were  used  as  quantitative  measure  of  protein 
abundance.  Expression  of  238  out  of  316  significantly  dysregulated  proteins  demonstrated  increasing  trend 
and  that  of  78  proteins  demonstrated  decreasing  trend. 

Pathway  enrichment  analysis  reveals  profound  alterations  in  metabolic  enzymes  regulating  the  glycolytic 
pathway  and  the  TCA  cycle.  To 
understand  the  biological  relevance 
of  these  alterations  in  the  bronchial 
epithelium  proteome  pathway 
analysis  was  performed  using 
Webgestalt  (WEB-based  GEne 
SeT  AnaLysis  Toolkit)  (22). 

Because  of  the  unavailability  of 
larger  human  bronchial  epithelium 
proteome  we  used  our  total 
identified  proteins  (2792  proteins) 
as  the  reference  proteome  of 
bronchial  epithelium  instead  of 
entire  human  proteome.  KEGG 
(Kyoto  Encyclopedia  of  Genes  and 
Genomes)  pathway  analysis  with 
316  dysregulated  proteins  revealed 
dysregulation  of  metabolic 

pathways.  Interestingly,  enzymes  of 
the  glycolytic  pathway,  TCA  cycle, 
pentose  phosphate  pathway, 
galactose,  and  glycogen 

metabolisms  were  overexpressed 
indicating  very  early  events  of 
possible  metabolic  reprogramming 
in  the  histologically  normal 


Ganglioside 
B  Iglbi 
Galactose 

jCALKl 


D 

Glucose 


I  HKI,  HK3 

PYGB  ,  PGMi,  PGM2*  GgPD  6-phosphoglucono- 

Glycogen — ►  Glucose  1-P  Glucose-6-P  — ►  ,  ^ 

»  ,  It  lactone 

V  UGP2  iiGPi  I 

GYG,GYS^  \  I  RGN 

UDP-glucose  Fructose-6-P  ^  , 

ItPFKP.PFKL  6-phosphogkKonate 

Ribulose-5-P 

JtALDOA,  ALDOC 

Dihydroxyacetone-P  ^11^  Dihydroxyacetone-P 

TPI  TPI 

Glyceraldehyde-3-P 

II  GAPDH 

1,3-biphospihoglycerate 

IIpgki 

3-phosphoglycerate 

II  PGAM5,  BPGM 

2-phosphoglycerate 

II  ENOl,  EN02,  ENOS 

Phosphoenolpyruvate 
LDHA,  LDHB  1 

Lactose  Pyruvate  — Acetyl-CoA 


FigurelO-  Glycogenolysis  (A),  galactose  metabolism 
(B),  glycolysis  (C),  pentose  phosphate  pathway  (D), 
and  TCA  cycle  (E)  enzymes  were  upregulated  (red)  in 
the  bronchial  brushings  of  individuals  at  risk  for  lung 
cancer  development. 
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/MDH 

Malate 


Citrate 

AC02\^ 
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Table  3:  Number  of  identified  protein  groups  and 
spectral  counts  of  each  risk  group 


Risk 

Group 

Filtered 

Spectra 

Distinct 

Peptides 

Protein 

Groups 

All  groups 

2792 

Low 

24637 

5812 

2116 

Medium 

24530 

5115 

1949 

High 

36598 

9579 

2771 
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bronchial  epithelium  of  individuals  at  risk  for  lung  cancer  development  (Figure  10).  Moreover,  six  out  of 
fourteen  enzymes  of  the  above  mentioned  pathways  are  rate  limiting  enzymes.  Results  of  the  Jonckheere- 
Terpstra  trend  test  showing  statistically  significant  increasing  trend  of  the  overexpressed  carbohydrate 
metabolic  enzymes  among  the  risk  groups  using  normalized  spectral  count  of  the  replicates  of  each  risk  group 
are  show  in  Figure  11. 

Overexpression  of  ME2  may  indicate  diversion  of  excess  TCA  cycle  intermediate  malate  to  fatty  acid 
biosynthetic  pathway  via  pyruvate  and  acetyl 
coA.  Among  the  316  significantly  altered 
proteins  there  were  49  metabolic  enzymes.  As 
expected  enzymes  of  lipid  metabolic  pathways, 
fatty  acid  synthase  (FASN),  Acyl-CoA 
synthetase  family  member  3  (ACSF3)  and 
arachidonate  15-lipoxygenase  were  also  found 
to  be  overexpressed.  Overexpression  of  alcohol 
and  aldehyde  metabolic  enzymes  ALDH1A1, 

ALDH3A1  and  AKR1B10  in  lung  tumor  tissues 
was  reported  previously  by  our  group  and  other 
investigators  and  now  we  found  these  enzymes 
to  be  overexpressed  in  the  histologically  normal 
bronchial  epithelium  of  individuals  at  risk  for  lung 
cancer  development. 

Pathway  analysis  revealed  metabolism  as  the  top  pathway  to  be  dysregulated  in  individuals  at  risk  for  lung 
cancer  development.  The  identification  of  ACSF3,  ALOX15  and  FASN  in  the  bronchial  epithelium  of  at  risk 
individuals  raises  the  possibility  of  involvement  of  lipid  metabolism  in  early  events  of  lung  tumorigenesis. 
Previously  we  reported  overexpression  of  acyl-coA  binding  protein/Diazepam  binding  inhibitor  (ACBP/DBI)  in 
lung  cancer  tissues  indicating  perturbation  of  lipid  metabolism  in  lung  cancer.  To  our  knowledge, 
overexpression  of  the  proteins  identified  and  validated  in  the  current  study  in  the  bronchial  epithelium  of 
individuals  at  high  risk  for  developing  lung  cancer  was  not  reported  before. 

Validation  of  over  expression  of  carbohydrate  metabolic  enzymes  and  selected  candidate  proteins  by  PRM  and 
immunohistochemistry  in  bronchial  the  epthelium.  These  experiments  are  in  progress. 

Warburg  effect  in  the  human  bronchial  epithelial  cell  lines  by  cigarette  smoke  condensate  (CSC). 

Given  the  overexpression  of  glycolytic  enzymes  including  LDH  we  were  curious  to  know  if  the  level  of  lactate 
were  altered  in  the  airway  epithelium  of  high  risk  individuals.  As  an  alternative  risk  model  we  treated  normal 
bronchial  epithelial  cell  lines,  16HBE  and  BEAS2B  with  CSC  in  vitro  and  measured  lactate  and  glucose 
concentrations  in  culture  supernatant  after  24  hours.  Significantly  enhanced  lactate  production  and  glucose 
consumption  in  vitro  as  a  result  of  treatment  of  the  cells  with  CSC  (Figure  11)  suggests  a  possible  role  of 
carbohydrate  metabolic  enzymes  in  lung  cancer  development.  Whether  these  metabolic  changes  have  any 
causative  effect  on  the  transformation  of  normal  bronchial  epithelial  cells  to  dysplastic  and  finally  malignant 
cells  as  Otto  Warburg  postulated  more  than  half  century  ago  remains  to  be  elucidated. 

Specific  Aim  2:  Evaiuate  the  roie  of  airway  epitheiium  tumor-initiating  stem/progenitor  celis  in  current 
and  former  smokers. 

Summary  of  Research  Findings: 

A.  Assessment  of  the  molecular  profiles  of  tumor-initlating  stem/progenitor  cells  from  normal  airway 
epithelium,  premaiignant  iesions  and  cancer. 

During  the  fourth  year  of  funding,  the  work  performed  over  the  last  3  years  and  described  in  the  previous 
progress  report  was  published  [29].  Briefly,  we  published  an  analysis  of  the  transcriptomic  changes  that  were 
most  relevant  for  progression  of  premaiignant  lesions  to  squamous  lung  cancer.  We  next  explored  one  of  the 
pathways  that  we  found  to  be  altered  with  initiation  and  progression  of  squamous  cell  carcinoma  (SCC)  within 
individual  patients.  This  pathway  is  the  reactive  oxygen  species  pathway.  Cigarette  smoke  contains  high  levels 


Figureii.lncreased  lactate  production  and  glucose  consumption  by  human  bronchial 
epithelial  cell  line,  BEAS2B  after  treatment  with  CSC  for  24  hours  in  vitro. 
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of  reactive  oxygen  species,  and  this  is  therefore 
directly  relevant  to  lung  injury  and  repair  in  current 
and  former  smokers. 

B.  Role  of  reactive  oxygen  species  in  lung 
carcinogenesis. 

We  found  that  reactive  oxygen  species  (ROS)  in 
airway  basal  stem  cells  are  usually  maintained  at 
low  levels  by  antioxidants  in  the  cell,  but  after 
injury  there  is  flux  of  ROS  from  low  levels  to  a 
higher  level  (“sweet  spot”)  and  it  is  this  flux  that 
allows  the  stem  cell  to  self-renew  for  repair.  The 
higher  ROS  levels  are  rapidly  reduced  in  the  stem 
cells  to  achieve  homeostasis  and  prevent  the 
basal  stem  cells  from  proliferating  too  much  and 
producing  hyperplasia  and  dysplasia  in  the  form  of 
a  premalignant  lesion.  We  further  showed  that  this 
increase  in  ROS  in  basal  stem  cells  in  the  injured 
airway  results  in  an  increase  in  Nrf2  expression 
which  acts  through  Notch  1  to  promote  the 
proliferation  of  airway  basal  stem  cells.  Of  note,  the 
premalignant  lesions  that  occur  when  the 
ROS/Nrf2/Notch1  pathway  is  perturbed  are  able  to 
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Figure  12.  Role  of  activated  Notchl  in  the  development  of 
premalignant  lesions  of  the  large  airways.  Repair  of  the  airway 
epithelium  was  assessed  after  polidocanol  injury.  For  Notchl 
activation,  K5-NICD  mice  received  intratracheal  RU486  to  turn  on 
the  NICD  transgene  in  K5-expressing  basal  cells.  For  Notchl 
inhibition  the  gamma  secretase  inhibitor,  DBZ,  was  administered.  At 
48  hours  post  injury  the  DBZ  treated  mice  had  delayed  repair  of  the 
airway  epithelium  with  reduced  proliferation,  while  the  K5-NICD  mice 
demonstrated  excessive  proliferation  of  basal  cells  with  loss  of 
differentiation  resulting  in  the  hyperplasia  and  dysplasia  of 
premalignant  lesions. 


spontaneously  resolve,  just  as  most  premalignant 
lesions  resolve  in  CORD  patients.  Thus,  it  is  likely  that  additional  genomic  events  are  responsible  for 
persistence  of  rare  premalignant  lesions  that  can  then  progress  to  invasive  lung  cancer  (Figure  12).  This  work 
was  published  in  [30]. 


C.  Assessment  of  other  pathways  identified  by 
gene  expression  profiling  of  premalignant 
lesions  and  invasive  squamous  lung  cancers. 

The  data  from  the  gene  expression  profiling  of  the 
laser  captured  airway  basal  stem  cells, 
premalignant  lesions  and  invasive  squamous  lung 
cancer  cells  suggests  that  there  are  multiple 
metabolic  pathways  that  are  important  as  the 
basal  stem  cells  undergo  hyperplasia  and 
dysplasia  and  then  transfer  into  invasive  cancer 
cells  (Figure  13).  We  are  currently  assessing 
these  pathways  and  determining  whether  they 
could  be  successfully  targeted  in  a 
chemoprevention  strategy. 


Figure  13.  Ingenuity  pathway  analysis  of  the  P53  and  MYC 
pathway  gene  expression  differences  between  premalignant 
lesions  and  invasive  squamous  lung  cancer.  The  P53  pathway  is 
predicted  to  be  inhibited,  while  the  MYC  pathway  is  expected  to  be 
activated. 


D.  Analysis  of  the  mutational  landscapes  of  lung  premalignancies. 

Uncertainty  about  the  clinical  behavior  of  a  premalignant  lesion  can  lead  to  either  inappropriate  inaction  or 
inappropriate  aggressive  treatment,  either  of  which  can  result  in  harm  to  the  patient.  Therefore,  in  the 
framework  of  this  DOD  funding,  we  initiated  the  assessment  of  the  role  of  somatic  mutations  that  occur  within 
the  pulmonary  field  of  cancerization,  in  early  pathogenesis  of  lung  cancer.  To  this  end,  we  first  identified  areas 
of  ADC  tumor,  adenocarcinoma  in  situ  (AIS,  if  present),  AAH  and  histologically  normal  regions  in  FFPE  tissue 
blocks  from  patients  who  underwent  surgical  resection  of  lung  adenocarcinoma.  We  then  used  LCM  to  retrieve 
cells  from  each  of  these  regions  within  the  same  subject  (in  these  preliminary  studies,  2-4  areas  of  each  tissue 
type  were  isolated  and  sequenced  for  each  subject).  DNA  was  then  extracted  from  these  tissues  and  the 
sequencing  libraries  were  constructed  and  exome  enriched.  The  libraries  were  sequenced  on  lllumina 
HiSeq2000  instrument  as  lOObp  paired-end  runs.  Most  FFPE  samples  provided  high-quality  sequencing  reads 
and  we  obtained  >20x  —  >60x  per  base  mean  coverage.  Sequencing  was  repeated  for  the  samples  that 
provided  lower  than  mean  30x  coverage. 
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Next,  the  sequencing  data  was  analyzed  for  the 
presence  of  somatic  mutations.  A  variant  was  defined  as 
somatic  mutation  when  p  <  0.1  for  the  Fisher  test 
comparing  the  number  of  the  reference  and  variant  reads 
in  the  normal  and  abnormal  (AAH,  AIS  or  ADC)  samples. 
Within  each  case,  position-specific  missense  and 
nonsense  mutations  were  compared.  Different  cases 
were  compared  for  the  mutations  at  gene-specific  level. 
We  define  mutations  found  only  in  AAH  as  premalignant, 
in  ADC  as  malignant  and  in  AAH,  AIS  and  ADC  as 
progression-associated  mutations.  We  identified  367 
point  mutations  affecting  amino  acid  sequences  of  362 
proteins  in  more  than  one  abnormal  sample.  Only  two 
cases  out  of  9  sequenced  in  the  pilot  experiment,  were 
found  to  bear  oncogenic  KRAS  mutations  in  ADC;  in  one 
of  these  cases  the  same  mutation  (at  lower  read  count) 
was  also  detected  in  the  AAH  lesion,  in  another  case  all 
AAH  lesions  had  the  wild  type  KRAS.  In  other  sequenced 
cases,  mutations  in  ERBB2  and  FGFR  oncogenes  were 
detected  in  ADC  but  not  in  AAH  or  AIS  samples.  The 
data  was  compared  to  the  Catalogue  of  Somatic 
Mutations  in  Cancer  (COSMIC)  database.  Among 
affected  proteins,  only  3%  (i.e.  11  genes)  overlapped 
with  the  COSMIC  cancer  census  list  (Figure  14). 
Interestingly,  all  of  them  were  identified  as  common 
mutations  in  AAH  and  at  least  one  ADC  sample. 
Additionally,  we  compared  the  recurrently 
mutations  identified  in  our  experiments  with  The 
Cancer  Genome  Atlas  (TCGA)  database  (Figure 
15).  The  analysis  demonstrated  that  the  majority  of 
the  identified  mutations  were  novel,  i.e.  not 
included  in  COSMIC  database. 


Further  analysis  demonstrated  that  approximately 
100  genes  bore  the  progression-associated 
mutations  that  were  common  in  at  least  2  out  of  9 
cases  sequenced  initially.  Figure  16  demonstrates 
33  genes  that  were  found  to  bear  progression- 
associated  mutations,  common  in  4  out  of  9  cases 
{PDE4DIP  and  MYH9  are  included  in  COSMIC 
census  list).  Interestingly,  recurring  mutations  were 
found  in  genes  involved  in  calcium  signaling  and 
extracellular  matrix/receptor  interaction  (data  not 
shown).  These  findings  demonstrating  the  role  of 
altered  calcium  signaling  and  cell  adhesion  in  early 
malignant  transformation,  are  novel  and  have  not 
been  reported  before.  Furthermore,  pathways 
affected  by  the  mutated  genes  were  identified 
utilizing  Gene  Ontology  and  pathways  from  the 
KEGG,  Biocarta  or  Reactome  databases  (Figure 
17).  The  observation  that  few  genes  mutated  in 
both  AAH  and  ADC  are  known  as  key  drivers 
based  on  either  COSMIC  or  TCGA  data,  indicates 
that:  a)  progression  associated  mutations  might 
facilitate  the  malignant  transformation  by  mutated 


I  AAH/ADC/Census 
I AAH/ADC/N  on- Census 
■  ADC/Non-Census 


Figure  14.  Common  point  mutations  identified  among 
subciones  in  the  same  patient.  The  blue  area  incjicates 
mutations  fountd  in  AAH,  AIS  and  ADC  within  the  same 
patient  and  not  inclucdecj  in  COSMIC  cancer  census  list;  recj 
area  —  mutations  common  in  AAH  and  ADC  and  inclucdecd  in 
COSMIC;  pink  area  —  mutations  unique  to  ADC  and  not 
inclucdecj  in  COSMIC. 


Figure  15.  Comparison  of 
the  recurrentiy  mutated 
genes  with  TCGA.  Malignant 
high:  genes  mutatecd  in  4  out 
of  9  patients,  low  —  mutatecd 
in  2-3  out  of  9  patients. 
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Figure  16.  Genes  bearing  recurrent  progression-associatecd 
mutations  in  4  out  of  9  cases. 


Figure  17.  Aberrant  pathways  enriched  by  progression- 
associated  and  maiignant  mutants. 


14 


Army  Award  W81 XWH-1 0-1  -1 006 

Comprehensive  Final  Report:  Reporting  Period  20  Sept  2010  -  19  Sept  2014 

key  driver(s),  or  b)  a  combination  of  two  or  more  progression  associated  mutations  that  are  not  oncogenic 
alone,  might  drive  malignant  transformation.  These  hypotheses  will  be  further  tested  by  mapping  progression 
and  malignant  associated  genes  into  pathway  context. 

These  new  data  are  critically  important,  given  the  fact  that  patients  at  risk  for  lung  cancer  may  have  subclinical 
disease  for  years  prior  to  presentation  and  diagnosis.  Moreover,  the  proteins  encoded  by  some  of  mutated 
genes  have  recently  proven  to  be  tractable  targets  for  new  anticancer  drug  development  [31-34].  We  anticipate 
that  our  results  will  allow  for  the  development  of  chemopreventive  and/or  therapeutic  strategies  that  are 
effective  during  the  window  of  opportunity  when  the  disease  is  still  subclinical.  Future  translation  of  these 
results  could  lead  to  a  paradigm  shift  in  the  way  in  which  we  address  the  clinical  problem  of  the  individual  at 
high  risk  for  lung  cancer. 


E.  Assessment  of  the  molecular  profiles  of  tumor-initiating  stem/progenitor  ceils  from  normal  airway 
epithelium,  premalignant  lesions  and  cancer. 

In  collaboration  with  the  Wistuba 


laboratory,  the  Massion  laboratory 
has  collected  bronchial  brushings 
from  individuals  at  risk  or  with  small 
cell  lung  cancer.  We  have  collected 
on  13  individuals  with  SCLC,  isolated 
total  RNA  and  were  able  to  obtain 
quality  RNA  on  1 1  of  them  as  shown 
in  Table  4.  The  RNA  is  part  of  a  study 
to  derive  a  signature  for  SCLC  risk  in 
the  field  of  cancerization  and  to 
attempt  to  identify  candidate  tumor 
initiating  progenitor  stem  cells  in 
SCLC. 


Table  4.  Bronchial  brushings  from  individuals  diagnosed  with  SCLC  yet 
untreated. 


project  2967  # 

Sample 

Barcode 

RIN 

28s:18s 

cone  (ng/ul) 

yield  (ug) 

SCLC 

95 

RNA 

522214 

7.9 

1.3 

260 

3.9 

X 

86 

RNA 

519951 

7.6 

1.1 

28 

0.42 

X 

182 

RNA 

554960 

7.6 

1.3 

283 

4.245 

X 

177 

RNA 

515207 

7.1 

2.1 

92 

1.38 

X 

180 

RNA 

539943 

7 

1.9 

52 

0.78 

X 

68 

RNA 

524855 

6.9 

0.9 

18 

0.27 

X 

74 

RNA 

524354 

6.7 

0.9 

247 

3.705 

X 

181 

RNA 

532271 

6.3 

2 

41 

0.615 

X 

179 

RNA 

555886 

2.4 

2.1 

68 

1.02 

X 

178 

RNA 

522995 

2.3 

1.7 

130 

1.95 

X 

F.  To  increase  our  understanding  of  the  roie  of  tumor- 
initiating  stem/progenitor  cells  in  the  pathogenesis  of 
lung  cancer  in  the  “field  cancerization”  that  develops  in 
current  and  former  smokers. 

RNA  sequencing  of  laser  captured  samples  to  identify 
genes  whose  expression  is  associated  with  different 
pathologic  stages  of  iung  SCC  carcinogenesis  (Aim  2B). 

The  first  year  of  this  work  involved  troubleshooting  to  refine 
methods  for  laser  capture  of  normal  basal  cells  (BC), 
premalignant  metaplastic/dysplastic  cells,  and  squamous 
cell  carcinoma  (SCC)  tumor  cells  from  frozen  sections  and 
subsequent  RNA  isolation.  RNA  from  one  subject  was 
hybridized  to  microarrays,  and  the  expression  of  control 
genes  behaved  as  expected,  indicating  that  it  was  feasible  to 
obtain  meaningful  results  from  this  material.  RNA  was  then 
isolated  from  several  more  subjects  and  successfully  used  to 
perform  high-throughput  sequencing  (RNA-seq).  To  identify 
genes  associated  with  SCC  progression,  genes  with  low 
expression  were  removed,  two  different  statistical  models 
were  used  to  assign  significance  to  each  gene  with  respect 
to  each  pairwise  comparison,  and  the  results  were 
intersected  to  identify  genes  whose  expression  changed 
significantly  at  an  early  (normal  to  premalignant)  or  late 
(premalignant  to  tumor)  stage  or  at  both  stages  (in  a 
stepwise  manner).  The  expression  of  these  genes  in  all 
samples  from  all  four  patients  is  illustrated  in  Figure  18. 


Patient  1  Patient  2  Patient  3  Patient  4 


I  I  Normal  airway  Premalignant  lesion  Tumor 

Figure  18.  Heatmap  of  expression  of  eariy,  iate, 
and  stepwise  genes.  Retd  antd  blue  incdicate 
expression  that  is  higher  or  lower  than  the  mean  within 
each  patient,  respectively.  Genes  are  hierarchically 
clusteretd  within  each  group  (early,  stepwise,  late). 
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Experimental  validation  of  candidate  genes  with  quantitative  real-time  PCR  and  immunofluorescence 
staining  (Aim  2B).  Three  genes  {CEACAM5,  SLC2A1  and  PTBP3)  with  potential  roles  in  lung  cancer  biology 
that  were  up-regulated  in  premalignant  lesions  and  tumor  cells  were  selected  for  further  validation.  qPCR 
confirmed  the  upregulation  of  CEACAM5  and  SLC2A1  in  premalignant  cells,  and  immunofluorescent  staining 
showed  that  all  three  genes  were  undetectable  in  normal  BC  but  highly  expressed  in  metaplastic  and  SCC 
cells.  Additionally,  the  Gene  Expression  Omnibus  (GEO)  Profiles  tool  was  used  to  examine  their  expression  in 
GEO  DataSets  relevant  to  lung  SCC  carcinogenesis.  SLC2A1  and  PTBP3  were  significantly  up-regulated  in 
SCC  tumors  (GDS1312)  [35]  with  respect  to  paired  adjacent  normal  tissue  and  in  a  set  of  SCC  lung  tumors 
relative  to  lung  adenocarcinomas  (GDS3627)  [36],  and  CEACAM5  and  SLC2A1  were  significantly  up-regulated 
in  bronchoscopic  brushings  from  healthy  current  smokers  relative  to  healthy  never  smokers  (GDS534)  [37]. 

Computational  identification  and  experimentai  vaiidation  of  dysreguiated  biologicai  mechanisms 
associated  with  progression  (Aim  2B).  Gene  Set  Enrichment  Analysis  (GSEA)  [38]  performed  using 
positionally  defined  gene  sets  revealed  that  the  transition  from  premalignant  to  tumor  is  associated  with 
coordinate  loss  of  expression  in  3p  and  gain  of  expression  in  3q26.33-3q29,  corresponding  to  previous  reports 
[39-41].  Ingenuity  Pathway  Analysis  (IPA)  was  then  used  to  show  that  the  early  and  late  stages  were  marked 
by  cell  cycle  progression,  and  cellular  migration  and  transformation,  respectively,  and  both  stages  were 
marked  by  an  increase  in  cell  proliferation  and  decrease  in  cell  death.  IPA  was  also  used  to  show  that  genes 
that  are  differentially  expressed  early  in  carcinogenesis  and  remain  dysreguiated  in  tumor  cells  are  enriched  in 
genes  previously  reported  to  be  up-regulated  by  MYC  and/or  down-regulated  by  TP53.  Immunofluorescent 
staining  was  used  to  confirm  that  MYC  was  localized  exclusively  to  the  nuclei  of  premalignant  lesions  and 
tumor  cells,  but  not  in  those  of  normal  BC.  More  information  about  this  work  may  be  found  in  a  recent 
publication  in  Cancer  Prevention  Research  [29]. 

Specific  Aim  3:  Test  airway-based  mRNA  and  microRNA  biomarkers  of  diagnosing  iung  cancer  in 

current  and  former  smokers  at  high  risk  for  lung  cancer  in  minimaliy  invasive  sites. 
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Mapping  the  airway-wide  moiecuiar  fieid  of  injury  in  smokers  with  iung  cancer. 

In  Years  03  and  04,  we  studied  the  molecular  spatial  map  of  field  effects  that  transverse  the  normal-appearing 
bronchus  adjacent  to  tumors  up  to  the  relatively  distant  nasal  epithelium.  We  surmised  that  this  analysis  would 
aid  in  identification  of  shared  genomic  changes  between  the  field  and  lung  cancer  and  that  extend  to 
compartments  (e.g.  nasal)  in  the  field 
cancerization  that  can  be  readily  accessible 
for  biomarker  analysis  in  screening  and 
clinical  settings.  Samples  (n=254)  from 
patients  with  (n=28)  and  without  (n=9)  lung 
cancer  that  were  collected  from  all  partnering 
institutions  (14  cancer  cases  from  MD 
Anderson)  were  processed  for  global 
expression  profiling  at  MD  Anderson  Cancer 
center  using  the  Human  Gene  2.0  ST 
platform  (Affymetrix)  and  data  analysis  was 
performed  in  collaboration  with  BU 
(Partnering  PI,  Dr.  Avrum  Spira)  (please  see 
report  by  Dr.  Spira).  We  identified  profiles 
differentially  expressed  between  NSCLCs 
and  benign  nodules.  47  genes  were  identified 
to  be  differentially  expressed  in  a  gradient 
like  manner  in  the  field  of  injury,  i.e.  genes 
that  increase  or  decrease  in  expression  with 
increasing  distance  from  the  primary  lung 
tumor  (Figure  19,  left  heat  map).  Gene 
expression  analysis  also  identified  106  genes 
to  be  significantly  differentially  expressed  in 
the  airways  of  NSCLC  patients  compared  to 


Figure  19.  Airway-wide  gene  expression  patterns  in  the  fieid  of 
injury.  Gene  expression  profiling  itjentifiecj  two  (distinct  patterns  of  gene 
expression  in  the  airway  fielcd  of  injury:  47  genes  itdentifiecd  to  be 
(differentially  expresse(d  in  a  gra(dient  manner  in  the  fiel(d  of  injury  of 
NSCLC  patients  and  not  in  patients  with  benign  (disease  (left  heatmap) 
and  106  genes  that  were  foun(d  to  be  (differentially  expresse(d  between 
NSCLC  patients  and  smokers  with  benign  (disease  and  that  were 
persistent  throuqhout  the  airwav  fiel(d  of  iniurv. 
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smokers  with  benign  disease  and  that  were  persistently  expressed  (non-gradient)  throughout  the  airway  field  of 
injury  (Figure  19,  right  heatmap).  Importantly,  these  gradient  and  persistent  genes  were  validated  in 
independent  samples  and  cohorts  (please  see  report  by  Dr.  Avrum  Spira).  These  data  point  to  field  of  injury 
profiles  that  embody  the  nearby  lung  tumor  and  inform  of  lung  cancer  pathogenesis  (gradient  field  of  injury) 
and  field  markers  that  are  more  likely  to  be  diagnostic  of  lung  cancer  in  minimally  invasive  sites  in  the  clinical 
setting  (persistent  genes).  These  findings  have  been  presented  by  Kusko  et  al.  in  the  past  2014  American 
Association  for  Cancer  Research  annual  meeting  [42]  and  are  currently  being  prepared  as  a  manuscript  for 
publication. 

4.  KEY  RESEARCH  ACCOMPLISHMENTS 
Year  01 

•  Identified  that  gene  expression  is  modulated  in  a  site-  and  a  time-dependent  manner  in  the  bronchial 
epithelium  of  early  stage  lung  cancer  patients. 

•  Identified  several  pathways  preferentially  activated  in  the  airway  adjacent  to  tumors  in  patients  with  lung 
cancer,  including  those  mediated  by  RISK,  NF-kB  and  ERK1/2. 

•  Completed  the  collection  and  field  cancerization  gene  expression  analysis  of  23  patients  (n=226  samples) 
with  lung  tumors  using  samples  obtained  from  lobectomy  specimens. 

Year  02 

•  Identified  aberrant  activation  of  canonical  oncogenes  in  the  molecular  field  of  injury  of  early  stage  NSCLC 
patients  including  phosphorylated  AKT  and  ERK  kinases. 

•  Completed  the  analysis  and  characterization  of  the  molecular  localized  field  cancerization  using  194 
samples  comprised  of  tumor,  normal  lung  and  airway  samples  from  20  NSCLC  cases. 

•  Derived  field  cancerization  expression  signatures,  comprised  of  genes  concordantly  and  significantly 
differentially  expressed  between  tumors  and  airways  compared  to  matched  normal  lung  tissue,  and 
pertinent  to  NSCLC,  lung  adenocarcinomas  and  SCCs. 

•  In  collaboration  with  BU  (Partnering  institution),  demonstrated  that  molecular  profiles  in  the  localized  field 
cancerization  are,  at  least  in  part,  relevant  to  the  molecular  field  of  injury  and  contain  markers  significantly 
and  concordantly  different  between  airways  of  patients  with  and  without  lung  cancer 

•  Identified  profiles  that  were  significantly  progressively  and  differentially  expressed  by  distance  from 
corresponding  lung  tumors  and  concordantly  modulated  between  tumors  and  paired  uninvolved  normal 
lung  tissues,  pinpointing  to  their  probable  roles  in  pathogenesis. 

•  Confirmed  the  differential  expression  of  several  site-dependent  and  -independent  markers  identified  by 
microarray  analysis  of  the  field  cancerization  at  the  RNA  and  protein  level  by  QRTPCR  and 
immunohistochemical  analysis,  respectively. 

•  In  collaboration  with  the  Partnering  Pis  and  Initiating  PI  of  this  award,  began  to  analyze  by  RNA-Seq  at  BU 
(20  cases,  156  samples)  and  comprehensive  microarray  profiling  at  MD  Anderson  (28  cases,  183  samples) 
the  molecular  field  of  injury  in  NSCLC  patients  and  cancer-fee  individuals. 

Year  03 

•  Characterized  the  transcriptomic  architecture  of  the  adjacent  airway  field  cancerization  in  early-stage  non¬ 
small  cell  lung  cancer.  These  analyses  demonstrated  that  the  adjacent  airway  field  of  cancerization  is 
comprised  of  markers  that  can  identify  lung  cancer  among  smokers  as  well  as  gradient  and  localized  site- 
dependent  expression  patterns  that  recapitulate  NSCLC  profiles.  These  findings  have  been  submitted 
recently  for  publication  and  are  under  revision. 

•  Demonstrated  for  the  first  time  that  the  field  cancerization  putative  oncogene,  LAPTM4B,  is  a  positive 
mediator  of  the  lung  cancer  cell  malignant  phenotype  evidenced  by  its  promotion  of  anchorage- 
independent  colony  formation  in  soft  agar. 

•  Studied  the  mRNA  expression  of  LAPTM4B  in  a  large  series  of  NSCLC  histological  tissue  specimens  for 
the  first  time  by  in  situ  hybridization.  This  analysis  revealed  that  LAPTM4B  expression  is  significantly 
positively  associated  with  smoking  and  worse  overall  survival. 

•  Demonstrated  that  the  field  cancerization  marker  LAPTM4B  protects  lung  cancer  cells  from  serum 
deprivation-induced  growth  inhibition  and  promotes  the  autophagy  response  following  serum  deprivation. 

•  Revealed  that  LAPTM4B  is  a  novel  positive  regulator  of  NRF2  transcription  factor  in  lung  cancer  cells. 
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•  In  collaboration  with  the  Partnering  Pis  and  Initiating  PI  of  this  grant,  performed  microarray  profiling  at  MD 
Anderson  of  254  field  cancerization  samples  from  28  cases  with  lung  cancer  and  9  cases  with  benign 
disease  to  begin  to  characterize  the  molecular  spatial  map  of  field  effects  that  transverse  the  bronchus 
adjacent  to  tumors  up  to  the  nasal  epithelium.  This  novel  analysis  demonstrated  that  the  molecular  map  of 
the  field  of  injury/cancerization,  in  patients  with  lung  cancer,  is  comprised  of  pathways  and  gene  sets, 
whose  enrichment  in  the  field  decreases  with  larger  distance  from  the  tumor  as  well  as  those  that  persist 
up  to  the  nasal  epithelium. 

•  Identified  specific  pathways  that  are  involved  in  stepwise  progression  to  SCC. 

•  Found  that  metabolic  changes  in  the  airway  basal  stem  cells  are  critical  for  stepwise  progression 

Year  04 

•  Began  to  understand  genomic  aberrations  such  as  point  mutations  (i.e.  single  nucleotide  polymorphisms, 
small  insertions  and  deletions)  and  copy  number  alterations  (i.e.  amplification,  deletion,  loss-of- 
heterozygosity,  allelic  imbalance)  in  the  airway  field  cancerization  using  deep  sequencing  and  SNP  array 
analyses.  These  findings  in  the  last  year  of  the  DoD  grant  is  the  foundation  for  current  and  future  studies  to 
investigate  the  genomic  architecture  of  the  airway  field  of  cancerization  in  lung  cancer. 

•  Completed  series  of  in  vitro  experiments  demonstrating  the  interplay  between  LAPTM4B  and  NRF2  in 
mediating  stress  responses  that  are  vital  for  the  survival  of  lung  cancer  cells. 

•  In  collaboration  with  the  Partnering  Pis  and  Initiating  PI  of  this  grant,  completed  microarray  analysis  of  254 
field  cancerization  samples  from  28  cases  with  lung  cancer  and  9  cases  with  benign  disease  to 
characterize  in  depth  the  molecular  field  of  injury  and  identify  field  of  injury  markers  that  inform  of  lung 
cancer  pathogenesis  and  that  are  diagnostic  of  lung  cancer  in  minimally  invasive  sites. 

•  Compared  the  mutational  landscapes  of  the  primary  lung  tumors,  premalignant  lesions  and  adjacent 
normal  tissues,  and  identified  novel  genes  associated  with  malignant  transformation. 

•  Found  that  reactive  oxygen  species  metabolism  plays  a  vital  role  in  homeostasis  of  the  airway  basal  stem 
cells  and  that  perturbing  this  homeostatic  mechanism  results  in  premalignant  lesions. 

•  Found  that  Nrf2  and  Notch  1  act  downstream  of  ROS  in  this  homeostatic  regulation  that  prevents  excessive 
self-renewal  and  promotes  mucociliary  differentiation. 

5.  CONCLUSION 

During  this  DOD  program,  we  1)  understood  how  the  molecular  field  of  injury  evolves  spatiotemporally 
following  curative  surgery  in  smoker  early-stage  NSCLC  patients;  2)  characterized  the  transcriptomic 
architecture  of  the  adjacent  airway  field  cancerization  in  early-stage  NSCLC;  3)  identified  gradient  profiles  in 
the  localized  field  cancerization  that  highly  embody  the  nearby  lung  tumors;  4)  demonstrated  that  the  molecular 
adjacent  field  cancerization  extends  to  relatively  less  invasive  large  airways  and  harbors  markers  that  can 
detect  lung  cancer  in  smokers  that  are  suspect  of  the  malignancy;  5)  validated  the  expression  of  a  novel  field 
cancerization  marker,  LAPTM4B,  in  airways  and  tumors  in  NSCLC  patients;  6)  revealed  a  novel  interplay 
between  LAPTM4B  and  the  NRF2  transcription  in  mediating  the  NRF2-stress  response  and  pathway  for 
ensuing  lung  cancer  cell  growth  and  survival;  7)  began  to  understand  genomic  aberrations  (mutations  and 
copy  number  alterations)  in  the  airway  field  of  cancerization  and  8)  identified  field  of  injury/cancerization 
markers  that  are  specific  to  NSCLC  vs  benign  disease,  inform  of  NSCLC  pathogenesis  and  can  detect  lung 
cancer  when  assessed  in  minimally  invasive  sites  in  the  lung. 

Also,  our  work  has  provided  most  comprehensive  understanding  to  date  of  progression  of  airway  basal  stem 
cells  to  premalignant  lesions  and  squamous  lung  carcinogenesis.  The  pathways  identified,  such  as  the  ROS 
pathway,  provide  valuable  insight  into  the  biology  of  stepwise  progression  to  SCC  in  current  and  former 
smokers.  In  the  future  we  plan  to  identify  the  drivers  that  are  critical  in  taking  premalignant  lesions  to  the  point 
of  invasive  SCC.  This  will  hopefully  lead  to  the  development  of  novel  targeted  therapies  to  prevent  and  treat 
SCC. 

We  determined  that  bronchoscopy  specimens,  such  as  brushings,  can  be  used  successfully  for  shotgun 
proteomic  profiling.  Thus,  utilizing  bronchial  brushings,  we  successfully  demonstrated  by  shotgun  proteomics 
that  subjects  at  increased  risk  for  lung  cancer  have  a  characteristic  differential  protein  expression  signature. 
Based  on  these  findings,  we  identified  a  signature  of  lung  cancer  risk  assessment  biomarkers  that  may  provide 
the  basis  of  patient  selection  for  surveillance  programs  and  chemoprevention.  Pathway  enrichment  analysis  as 
well  as  in  vitro  data,  suggests  possible  metabolic  reprogramming  in  the  airway  epithelium  of  high  risk 
individuals. 
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7.  INVENTIONS,  PATENTS  AND  LICENSES 

Nothing  to  report. 

8.  REPORTABLE  OUTCOMES 

•  Pease  see  the  list  of  abstracts  and  publications  above  for  the  complete  list  of  reportable  outcomes. 

•  List  of  genes  differentially  expressed  during  stepwise  progression  to  SCC  deposited  in  GEO. 

•  Developed  a  protocol  for  massively  parallel  sequencing  of  RNA  isolated  from  premalignant  lesions  from 
FFPE  archived  tissues  by  laser  capture  microdissection. 

•  Developed  a  protocol  for  whole  exome  DNA  sequencing  of  cells  of  interest  in  lung  cancer  continuum,  laser 
capture  microdissected  from  FFPE  archived  tissues. 

•  Developed  a  protein  signature  that  is  able  to  differentiate  subjects  at  risk  for  lung  cancer  and  lung  cancer 
patients. 

9.  OTHER  ACHIEVEMENTS 

Aik  Ooi  received  employment  at  Fluidigm  company  based  on  his  research  experience  on  this  project. 
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Abstract 

Background;  The  identification  of  early  stage  non-small  cell  lung  cancer  (ES  NSCLC) 
patients  (pts)  at  higher  risk  for  recurrence  or  second  primary  tumor  (SPT)  development  is 
vital  to  personalizing  prevention  and  therapy.  We  sought  to  decipher  spatial  and  temporal 
patterns  of  gene  expression  in  the  airway  field  of  ever-smoker  ES  NSCLC  pts  to  better 
understand  lung  cancer  pathogenesis  and  predict  recurrence  or  SPT  development. 

Methods:  Pts  on  the  prospective  Vanguard  study  had  definitively  treated  ES  (I/ll)  NSCLC, 
were  current/former  smokers,  and  had  bronchoscopies  with  brushings  obtained  from  the 
main  carina  (MC)  at  baseline,  12,  and  24  months  following  resective  surgery  and  from 
different  anatomical  regions  at  baseline.  Expression  profiling  is  ongoing  for  all  eligible  pts 
(41  pts,  326  samples).  To  query  temporal  and  spatial  airway  expression  profiles,  two  sets 
of  six  pts  were  selected  based  on  complete  processed  time  point  and  baseline  airway  site 
(3  different  sites  per  pt)  arrays  (Affymetrix  Human  Gene  1.0  ST),  respectively.  Temporally 
and  spatially  differentially  expressed  genes  were  independently  identified  based  on  a 
p<0.01  of  a  univariate  t-test  with  estimation  of  the  false  discovery  rate  (FDR),  studied  by 
hierarchical  clustering  and  principal  component  analysis  (PCA),  and  functionally  analyzed 
using  network  analysis. 

Results:  871  gene  features  were  differentially  expressed  among  MCs  of  six  NSCLC  pts  at 
baseline,  12  and  24  months  and  were  shown  to  separately  group  the  MCs  as  evident  in 
both  cluster  and  PC  analyses.  Moreover,  pathways  analysis  of  the  temporally  modulated 
genes  showed  that  a  gene-network  mediated  by  extracellular  regulated  kinase  (ERK1/2) 
was  most  significantly  elevated  (p<0.001)  in  function  between  MCs  at  24  months  versus 
baseline.  763  and  931  gene  features  were  differentially  expressed  between  MCs  and 
adjacent-to-resected  tumors  (ADJ)  airways  and  between  MC,  ADJ  and  non-adjacent 
(distant-to-resected  tumor)  (NON-ADJ)  airways,  respectively.  Moreover,  pathways  analysis 
of  the  spatially  modulated  genes  revealed  that  gene-networks  mediated  by  nuclear  factor- 
kB  (NF-kB)  and  ERK1/2-mediated  were  most  significantly  elevated  (p<0.001)  in  function  in 
ADJ  airway  samples  versus  MCs.  Furthermore,  PCA  revealed  that  while  ADJ  airway 
samples  grouped  separately  and  closely  together,  one  MC  and  3  NON-ADJ  airway  samples 
resided  closely  with  ADJ  samples,  which  were  then  found  to  originate  from  3  pts  with 
evidence  of  recurrence,  SPT  or  suspicion  of  recurrence. 

Conclusions;  Our  findings  highlight  expression  signatures  and  pathways  (ERK1/2  and  NF- 
kB)  in  a  “cancerization  field”  that  may  drive  lung  cancer  pathogenesis  and  be  associated 
with  recurrence  or  SPT  development  in  ES  NSCLC  pts  and  thus  useful  for  derivation  of 
biomarkers  to  guide  personalized  prevention  strategies.  Supported  by  DoD  grants 
W81XWH-04-1-0142  and  \A/81XWH-1 0-1 -1007. 
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Abstract 

Detection  and  chemoprevention  of  non-small  cell  lung  cancer  (NSCLC)  have  been 
unsuccessful  in  part  due  to  our  limited  knowledge  of  the  pathogenesis  of  the  disease.  It 
has  been  suggested  that  normal  airway  epithelia  share  molecular  abnormalities  with 
tumors  and  may  serve  as  progenitors  for  lung  malignancies.  We  sought  to  analyze 
molecular  profiles  of  lung  adenocarcinomas  and  squamous  cell  carcinomas  (SCCs),  two 
major  subtypes  of  NSCLC,  and  matched  normal  airways  to  elucidate  aberrant  expression 
patterns  in  early  phases  of  lung  tumorigenesis.  All  specimens  were  obtained  from  primary 
early  stage  NSCLC  consented  patients  who  did  not  receive  neoadjuvant  therapy  (n=20). 
We  profiled  RNA  isolated  from  tumors  and  normal  lung  as  well  as  from  brushings  of 
multiple  matched  airways  that  were  histologically  confirmed  to  lack  neoplastic  or 
preneoplastic  cells  (n=194).  Expression  signatures  signifying  genes  significantly  and 
concurrently  differentially  expressed  between  both  tumors  and  airways  compared  to  normal 
lung  tissue  (tumor-airway-normal/TAN  signatures)  were  then  derived  independently  for 
see  and  smoker  adenocarcinoma  cases  and  were  comprised  of  1,803  and  1,938  genes, 
respectively.  The  TAN  signatures  effectively  clustered  tumor  and  airways  from  normal  lung 
samples  (p<0.001).  Further  analysis  showed  that  a  subset  of  the  genes  separated  SCC- 
and  adenocarcinoma-adjacent  airways.  Moreover,  pathways  and  gene-network  analysis 
using  Ingenuity  pathways  software  highlighted  similarities  and  differences  in  pathway 
modulation  between  airway  epithelial  fields  of  SCCs  and  adenocarcinomas.  Embryonic 
stem  cell  and  eicosanoid  signaling  pathways  were  most  significantly  modulated  among 
those  common  to  both  TAN  signatures  (p<0.001).  Retinoic  acid  receptor  and  stem  cell 
signaling  pathways  mediated  by  NANOG  and  lineage  oncogene  SOX2  were  most 
significantly  modulated  in  the  SCC  TAN  signature,  whereas  NF-kB  and  PTEN  signaling 
pathways  were  most  prevalent  in  the  adenocarcinoma  TAN  airway  signature  (all  p<0.001). 
Gene  networks  mediated  by  lineage  oncogene  NKX2-1/TITF-1  down-regulation  and  TP63 
up-regulation,  and  networks  mediated  by  increased  expression  of  the  MET  and  ERBB2 
oncogenes  were  predominantly  functionally  modulated  in  the  SCC  and  adenocarcinoma 
TAN  signatures,  respectively.  Quantitative  PCR  analysis  confirmed  up-regulation  of  MET  in 
adenocarcinomas  and  normal  airways  compared  to  normal  lung.  These  findings  highlight 
expression  patterns  and  pathways  that  are  deregulated  differentially  in  the  pathogenesis  of 
lung  adenocarcinomas  and  SCCs  and  thus  offer  detection  markers  as  well  as  therapeutic 
targets  to  guide  personalized  chemoprevention  in  early  stage  NSCLC  patients.  Supported 
by  DoD  grant  W81XWH-10-1-1007. 
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Abstract 

Earlier  work  has  identified  in  lung  cancer  a  field  cancerization  (FC)  phenomenon  in  which 
tumors  and  adjacent  normal  appearing  tissues  share  specific  molecular  abnormalities  (e.g., 
loss  of  heterozygosity)  that  may  be  highly  pertinent  to  cancer  pathogenesis.  We  sought  to 
characterize  the  global  molecular  airway  FC  adjacent  to  early-stage  non-small  cell  lung 
cancer  (NSCLC)  in  an  attempt  to  unravel  profiles  that  may  help  to  explain  the  development 
of  the  disease.  We  performed  whole-transcript  expression  profiling  of  a  set  of  resected 
early-stage  NSCLC  specimens  (n=20  patients)  with  matched  histologically  normal  airways 
of  varying  distance  from  the  tumor  and  paired  uninvolved  normal  lung  tissue  (n=194 
samples).  Using  linear  mixed-effects  models,  we  derived  FC  profiles  signifying  genes 
concordantly  differentially  expressed  between  tumors  and  airways  compared  to  normal  lung 
tissues.  Gene  set  enrichment  analysis  demonstrated  that  a  subset  of  the  genes  (n=299) 
was  significantly  and  congruently  modulated  between  large  airways  of  smokers  with  and 
without  lung  cancer.  We  then  questioned  whether  the  airway  FC  exhibits  site  from  tumor- 
dependent  expression  patterns. 

Ordinal  regression  analysis  identified  airway  profiles  (n=422  genes)  that  were  significantly 
progressively  expressed  by  distance  from  tumors  and  topologically  organized  into  canonical 
cancer-associated  pathways,  such  as  eukaryotic  initiation  factor,  p70S6K  kinase,  polo-like 
kinase  and  mammalian  target  of  rapamycin  signaling  (all  p<0.001).  In  addition,  the  site- 
dependent  airway  profiles  recapitulated  NSCLC  expression  patterns  and  were  concordantly 
modulated  between  tumors  and  uninvolved  normal  lung  tissues  pinpointing  their  probable 
roles  in  lung  cancer  pathogenesis.  Quantitative  real-time  PCR  (QRTPCR)  analysis 
confirmed  the  differential  expression  of  FC  markers  selected  by  both  pathways  analysis 
and  statistical  criteria.  Notably,  lysosome  associated  protein  transmembrane  4  beta 
(LAPTM4B),  a  putative  oncogene  with  no  known  role  in  lung  carcinogenesis,  was  among 
the  top  5  site-dependent  FC  markers  and  was  significantly  elevated  in  NSCLC  and 
immortalized  bronchial  epithelial  cell  lines  compared  to  normal  cells.  Furthermore,  transient 
or  stable  knockdown  of  LAPTM4B  by  RNA  interference  decreased 

NSCLC  cell  growth  as  well  as  anchorage-dependent  and  -independent  colony  formation.  In 
conclusion,  our  efforts  in  understanding  the  adjacent  molecular 

FC  in  NSCLC  unraveled  airway  profiles  that  1)  are,  in  part,  relevant  to  lung  cancer 
detection;  2)  are  modulated  by  distance  from  corresponding  tumors;  3)  recapitulate  NSCLC 
expression  patterns  and  4)  harbor  markers  engaged  in  mediating  the  lung  malignant 
phenotype.  Profiling  the  adjacent  airway  FC  in  conjunction  with  tumors,  may  provide 
additional  insights  into  the  molecular  pathology  of  NSCLC.  Funded  in  part  by  Department 
of  Defense  award  W81XWH-1 0-1 -1007. 
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Abstract 

Increasing  our  understanding  of  early  events  in  the  pathogenesis  of  lung  cancer  is  crucial 
for  identification  of  new  targets  for  prevention  and  treatment  of  this  malignancy.  Earlier 
work  has  shown  that  seemingly  normal  cells  adjacent  to  the  tumor  carry  specific  molecular 
alterations  that  are  characteristic  of  the  tumor  itself  suggestive  of  a  field  of  cancerization. 
By  sampling  and  studying  normal-appearing  tissue,  the  molecular  field  of  cancerization 
provides  biological  insights  into  early  phases  in  cancer  development.  In  this  study,  we 
sought  to  characterize  molecular  field  effects  in  the  normal-appearing  airway  that  are  most 
representative  of  the  nearby  lung  tumor,  and  thus,  are  most  likely  to  denote  early  events  in 
lung  carcinogenesis.  To  achieve  this,  we  performed  genome-wide  expression  profiling  of 
resected  field  cancerization  specimens  (n=20  patients)  comprised  of  matched  early-stage 
non-small  cell  lung  cancers  (NSCLCs),  cytologically  normal  airways  with  varying  spatial 
distance  from  the  tumors  and  distant  (relative  to  location  of  tumors)  normal  lung  tissues 
(n=194  samples).  Using  ordinal  logistic  regression,  we  identified  422  genes  that  were 
progressively  modulated  in  expression  in  normal-appearing  airways  by  spatial  distance  from 
tumors.  Notably,  when  examined  in  paired  NSCLC  and  normal  lung  tissues,  these  genes 
were  found  to  recapitulate  tumor  expression  profiles.  We  then  sought  to  examine  the  role 
of  lysosomal  protein  transmembrane  4  beta  (LAPTM4B),  a  putative  oncogene  that  was 
found  to  be  up-regulated  in  airways  by  shorter  spatial  distance  from  tumors,  in  lung 
oncogenesis.  LAPTM4B  was  significantly  elevated  in  NSCLC  tissues  compared  to  paired 
distant  normal  lung  and  was  predictive  of  poor  survival  in  lung  adenocarcinoma.  Moreover, 
LAPTM4B  promoted  anchorage-dependent  and  -independent  lung  cancer  cell  growth  and 
was  crucial  for  cellular  survival  and  the  autophagy  response  under  nutrient-  and  serum- 
deprived  conditions.  In  addition,  pathways  analysis  of  a  LAPTM4B-dependent  gene 
expression  profile  revealed  decreased  activation  of  the  canonical  nuclear  factor  erythroid  2- 
like  2  (NRF2)-mediated  pathway  following  LAPTM4B  knockdown.  Further,  we  found  that 
LAPTM4B  augmented  the  expression  and  nuclear  translocation  of  the  NRF2  transcription 
factor  following  serum  deprivation  pointing  to  the  probable  role  of  the  novel 
LAPTM4B/NRF2  signaling  axis  in  promoting  lung  cancer  cell  survival.  All  in  all,  our  study 
points  to  molecular  field  of  cancerization  profiles  in  the  normal-appearing  airway  that  highly 
signify  the  nearby  lung  tumor  and  comprise  early  mechanisms  (e.g.  LAPTM4B)  in  lung 
carcinogenesis. 
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Abstract 

Lung  cancer  mortality  is  the  leading  cause  of  cancer  death  in  the  United  States  in  part 
because  diagnosis  occurs  after  regional  or  distant  metastasis  of  the  disease.  Identifying 
effective  early  detection  biomarkers  is  crucial  for  improving  lung  cancer  clinical 
management.  Moreover,  molecular  biomarkers  for  early  disease  detection  may  provide 
insight  into  the  molecular  pathways  associated  with  disease  development  and  progression. 
Our  lab  has  shown  that  smoking-induced  gene  expression  alterations  are  mirrored  in  the 
epithelia  of  the  mainstem  bronchus,  buccal  and  nasal  cavity.  We  have  additionally 
demonstrated  that  gene-expression  profiles  in  cytologically  normal  mainstem  bronchial 
epithelium  can  serve  as  an  early  diagnostic  biomarker  for  lung  cancer.  Here  we  expand  on 
our  previous  work  by  spatially  mapping  the  molecular  field  of  injury  throughout  the  entire 
respiratory  tract  in  smokers  with  lung  cancer.  Using  Affymetrix  Gene  ST  2.0  arrays,  we 
profiled  genome-wide  gene-expression  in  1)  lung  lesions  and  adjacent  normal  lung 
obtained  from  smokers  undergoing  surgical  resection,  2)  epithelial  brushings  obtained  at 
intraoperative  bronchoscopy  from  the  nasal  epithelium,  main  carina  and  ipsilateral  and 
contralateral  proximal  and  distal  bronchi  (relative  to  the  location  of  the  resected  lung 
lesion),  and  3)  epithelial  brushings  obtained  at  lobectomy  from  sub-segmental  bronchus 
(adjacent  to  tumor).  Linear  modeling  approaches  comparing  the  airways  and  tumors  of 
patients  with  cancer  to  those  with  benign  lung  disease  were  used  to  explore  relationships  in 
cancer-specific  gene-expression  alterations  across  sites  within  the  respiratory  tract.  We 
found  that  genes  upregulated  in  the  small  airways  leading  to  the  tumor  were  enriched  in 
genes  upregulated  in  the  mainstem  bronchus  and  main  carina  of  smokers  with  lung  cancer. 
In  addition,  genes  upregulated  in  the  bronchus  and  main  carina  of  smokers  with  lung 
cancer  showed  enrichment  among  cancer  associated  genes  elevated  in  the  nose. 
Furthermore,  a  linear  mixed  effects  model  uncovered  genes  and  pathways  which  change  in 
expression  in  a  gradient-like  manner  as  distance  from  the  tumor  increases.  Our  findings 
suggest  that  the  molecular  field  of  injury  encompasses  airway-wide  alterations  throughout 
the  entire  respiratory  tract  of  smokers  with  lung  cancer  as  well  as  gradient  profiles  that 
change  with  respect  to  proximity  of  the  nearby  tumor.  These  molecular  alterations  may 
ultimately  serve  as  early  detection  biomarkers  for  lung  cancer  and  provide  new  insights  into 
early  stages  of  lung  carcinogenesis. 
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ABSTRACT 

Lung  cancer,  of  which  non- small- cell  lung  cancer  com¬ 
prises  the  majority,  is  the  leading  cause  of  cancer- 
related  deaths  in  the  United  States  and  worldwide. 
Lung  adenocarcinomas  are  a  major  suhtype  of  non¬ 
small-cell  lung  cancers,  are  increasing  in  incidence  glo¬ 
bally  in  hoth  males  and  females  and  in  smokers  and 
non-smokers,  and  are  the  cause  for  almost  50%  of 
deaths  attrihutahle  to  lung  cancer.  Lung  adenocarci¬ 
noma  is  a  tumour  with  complex  biology  that  we  have 
recently  started  to  understand  with  the  advent  of 
various  histological,  transcriptomic,  genomic  and  pro- 
teomic  technologies.  However,  the  histological  and 
molecular  pathogenesis  of  this  malignancy  is  still 
largely  unknown.  This  review  will  describe  advances  in 
the  molecular  pathology  of  lung  adenocarcinoma  with 
emphasis  on  genomics  and  DNA  alterations  of  this 
disease.  Moreover,  the  review  will  discuss  recognized 
lung  adenocarcinoma  preneoplastic  lesions  and 
current  concepts  of  the  early  pathogenesis  and  pro¬ 
gression  of  the  disease.  We  will  also  portray  the  field 
cancerization  phenomenon  and  lineage- specific  onco¬ 
gene  expression  pattern  in  lung  cancer  and  how  both 
remerging  concepts  can  be  exploited  to  increase  our 
understanding  of  lung  adenocarcinoma  pathogenesis 
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for  subsequent  development  of  biomarkers  for  early 
detection  of  adenocarcinomas  and  possibly  personal¬ 
ized  prevention. 

Key  words:  airway  epithelium,  field  cancerization, 
genetics,  lung  cancer,  molecular  biology. 

INTRODUCTION 

Lung  cancer  is  the  leading  cause  of  cancer  deaths  in 
the  United  States  and  worldwide  in  both  developing 
and  developed  regions.^  The  high  mortality  of  this 
disease  is  in  part  due  to  the  late  diagnosis  of  the 
majority  of  lung  cancers  after  regional  or  distant 
spread  of  the  malignancy^  and  when  only  palliative 
treatment  options  are  available.^  Given  that  various 
epithelial  tumours  develop  in  a  multi-stage  stepwise 
fashion,  it  is  plausible  to  assume  that  early  diagnosis 
of  lung  cancer  or  intraepithelial  lesions  coupled  with 
effective  prevention  strategies  will  improve  survival  of 
patients  and  reduce  the  significant  health  burden  and 
mortality  associated  with  this  disease.^  Despite  recent 
encouraging  findings  from  the  National  Lung  Screen¬ 
ing  Trial  (NLST),^  early  detection  of  lung  cancer  is 
challenging  due  to  the  lack  of  biomarkers  for  early 
diagnosis  of  the  disease  and  to  the  presence  of  mul¬ 
tiple  neoplastic  molecular  pathways  that  mediate 
lung  carcinogenesis.  A  better  understanding  of  the 
molecular  origins  of  lung  cancer  is  expected  to  pave 
the  way  for  unmet  effective  and  personalized  strate¬ 
gies  for  lung  cancer  prevention  and  treatment. 

The  two  major  forms  of  lung  cancer  are  non-small- 
cell  lung  cancer  (NSCLC),  which  accounts  for 
approximately  85%  of  all  diagnosed  lung  cancers,  and 
small- cell  lung  cancer  (SCLC),  which  constitute  about 
15%  of  lung  neoplasms.^  NSCLC  is  comprised  of  three 
major  histological  subtypes,  squamous-cell  carcino¬ 
mas  (SCC),  lung  adenocarcinomas  and  large -cell  lung 
carcinomas.^’^  Several  major  differences  exist  between 
adenocarcinomas  and  SCC,  the  two  major  subtypes  of 
NSCLC.  Compared  with  SCC  and  SCLC  that  arise  from 
the  major  bronchi  and  are  centrally  located,  pulmo¬ 
nary  adenocarcinomas  arise  from  small  bronchi, 
bronchioles  or  alveolar  epithelial  cells,  and  are  typi¬ 
cally  peripherally  located  as  reviewed  elsewhere. 
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Clinically,  SCC  and  lung  adenocarcinoma  respond 
differently  to  chemotherapeutic  agents,  exemplified 
by  the  use  of  pemetrexed  for  treatment  of  the  latter 
subtype  and  not  for  SCC.^’^  Moreover,  although 
smoking  is  the  major  causative  factor  in  lung  cancer 
pathogenesis,  significant  differences  in  smoking  pat¬ 
terns  are  observed  between  the  two  major  NSCLC 
histological  subtypes.  Whereas  SCC  pathogenesis  is 
strongly  linked  to  smoking,  lung  adenocarcinoma  is 
the  more  common  histological  subtype  in  never- 
smoker  patients. Accumulating  evidence  suggests 
that  lung  adenocarcinoma  arising  in  never-smokers  is 
a  disease  with  different  pathological  and  epidemio¬ 
logical  features  compared  with  adenocarcinomas 
causally  linked  to  cigarette  smoking.^^  Specifically, 
never-smoker  lung  adenocarcinoma  is  more  com¬ 
monly  diagnosed  in  females  compared  with  males^^ 
and  is  more  frequently  found  in  eastern  and  southern 
parts  of  the  Asian  continent,^^  and  displays  better 
prognosis  and  survival  compared  with  ever-smoker 
patients.^’^^’^^  At  the  molecular  level,  and  to  date,  two 
major  pathways  are  thought  to  mediate  lung  adeno¬ 
carcinoma  development:  an  epidermal  growth 
factor  receptor  (EGFi?) -dependent  pathway  in 
never-smokers  and  a  Kirsten  rat  sarcoma  oncogene 
(K7MS) -dependent  signalling  module  in  smokers^®“^^ 
(discussed  further  later).  Further  understanding  of 
lung  adenocarcinoma  pathogenesis  would  be  needed 
to  unravel  other  pathways  that  play  important  roles 
in  development  of  this  major  subtype  of  lung  cancer. 

Lung  adenocarcinomas  have  a  wide  spectrum  of 
clinical,  molecular  and  histological  features.^^  The 
2004  World  Health  Organization  (WHO)  classification 
of  lung  tumours  included  four  growth  patterns  for  its 
adenocarcinoma  classification:  bronchioloalveolar 
(BAG;  also  known  as  lepidic),  acinar,  papillary  and 
solid.^^  Most  invasive  lung  adenocarcinomas  are 
heterogeneous  in  nature  and  include  more  than  one 
of  these  histological  patterns.^^’^^  The  existing  lung 
adenocarcinoma  histological  heterogeneity  and  the 
varying  clinicopathological  features  (e.g.  patient 
outcome)  of  the  aforementioned  histological  patterns 
highlight  the  importance  of  incorporating  histologi¬ 
cal  pattern  information  into  clinical  management  of 
this  complex  disease.  More  recently,  the  European 
Respiratory  Society  (ERS),  the  International  Associa¬ 
tion  for  the  Study  of  Lung  Cancer  (lASLC)  and  the 
American  Thoracic  Society  (ATS)  sponsored  a  new 
classification  of  lung  adenocarcinoma.^®  The  new 
classification  study  presented  several  modifications 
to  the  WHO  2004  criteria  for  diagnosis  of  resected 
adenocarcinoma  specimens.  Mainly,  the  consortium 
study  suggested  that  the  term  BAG  should  be  discon¬ 
tinued.^®  Instead,  it  is  agreed  that  adenocarcinoma  in 
situ  (AIS)  and  minimally  invasive  adenocarcinoma 
(MIA)  are  to  be  used  for  small  adenocarcinomas  with 
either  pure  lepidic  growth  or  predominant  lepidic 
growth  with  less  than  5  mm  invasion,  respectively.^® 
Moreover,  the  new  classification  dropped  the  use 
of  mixed  subtype,  and  instead,  adenocarcinomas 
are  classified  according  to  their  predominant 
subtype.^® 

This  review  will  describe  advances  in  the  molecular 
pathology  of  lung  adenocarcinoma  with  emphasis 
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on  genomics  and  DNA  alterations  of  this  disease. 
Moreover,  the  review  will  describe  recognized  lung 
adenocarcinoma  preneoplastic  lesions  and  current 
concepts  of  the  early  pathogenesis  and  progression  of 
the  disease.  We  will  also  portray  the  field  canceriza- 
tion  phenomenon  and  lineage-specific  oncogene 
expression  pattern  in  lung  cancer  and  how  both 
remerging  concepts  can  be  exploited  to  increase  our 
understanding  of  lung  adenocarcinoma  pathogenesis 
for  subsequent  development  of  biomarkers  for  early 
detection  of  adenocarcinomas  and  possibly  personal¬ 
ized  prevention. 


REVIEW 

Molecular  pathology  of  lung  adenocarcinoma 

Lung  adenocarcinomas  exhibit  unique  genomic  aber¬ 
rations  compared  with  lung  SGG,  indicating  that  the 
molecular  pathology  of  both  NSGLG  subtypes  encom¬ 
passes  different  molecular  pathways  of  development 
and  progression.^  Earlier  studies  have  shown  that 
lung  SGG  exhibit  higher  frequencies  of  deletions  at 
chromosomal  regions  17pl3  (TP53),  13ql4  (RB),  9p21 
{CDKN2A),  8p21-23  and  3p  compared  with  lung 
adenocarcinomas.^^“^®  Moreover,  many  of  the  afore¬ 
mentioned  molecular  abnormalities  (e.g.  allelic  losses 
at  9p21  and  13q24)  occur  in  the  sequential  multi-step 
progression  of  SGG  but  not  of  adenocarcinomas.®’^^  In 
contrast,  mutations  in  the  KRAS,  EGFR  and  HER2I 
NEU  oncogenes  occur  almost  exclusively  in  adeno- 
carcinomas.^’^®’^^’^®’®®’®^  Amplification  of  the  embryonic 
stem  cell  (ESG)  factor  sex  determining  Y-box  2  {SOX2) 
is  exclusive  to  SGG,®^’®®  and  increased  gene  dosage 
and  protein  expression  of  thyroid  transcriptional 
factor-1 /NK2  homeobox  1  {TITE-1/NKX2-1)  is  preva¬ 
lent  in  lung  adenocarcinoma,  indicating  that  both 
transcriptional  factors  most  likely  function  as  lineage - 
specific  genes  in  lung  cancer.®^  ®®  This  section  will 
highlight  molecular  abnormalities,  with  special 
emphasis  on  genomics  and  DNA  alterations,  of  lung 
adenocarcinoma  that  render  this  malignancy  a 
unique  entity. 


Lung  adenocarcinoma  genomics 

KRAS,  a  low  molecular  weight  guanosine  triphos¬ 
phatase  (GTPase)  and  the  major  upstream  activator  of 
the  RAE-MEK-ERK  pathway,  is  considered  to  be  the 
most  frequently  mutated  oncogene  in  lung  adenocar¬ 
cinomas.^®’®^’®^  As  mentioned  before,  mutations  in  this 
oncogene  are  more  common  in  adenocarcinomas 
arising  in  ever-smoker  (former  and  current)  lung 
cancer  patients. ^®’^®’^^’^®’®®’®®’®®’®®  Most  KRAS  mutations 
involve  replacing  glycine  12  with  other  amino  acids 
such  as  valine  (G12V),  aspartic  acid  (G12D)  and 
glutamic  acid  (G12D),  and  replacing  glycine  13,  and 
are  activating  rendering  the  gene  with  reduced 
GTPase  activity  with  subsequent  potent  activation  of 
mitogenic  and  proliferative  signalling  through  the 
RAE-MEK-ERK  cascade. ^®’®^“®®  Thus,  it  is  plausible  to 
assume  that  therapeutic  strategies  targeting  KRAS 
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would  be  very  beneficial  in  adenocarcinomas  with 
activating  mutations  in  this  oncogene.  However,  there 
are  currently  no  available  treatment  options  for 
KRAS-mutant  lung  adenocarcinomas  compared  with 
tumours  with  mutations  in  other  oncogenes, as 
strategies  targeting  KRAS  farnesylation,  MEK  activa¬ 
tion  and  BRAE  have  either  failed  or  yielded  no 
responses.^^“^^ 

In  contrast  to  KRAS,  mutations  in  EGER  are  strongly 
linked  to  lung  adenocarcinomas  arising  in  never- 
smokers  and  are  suggested  to  molecularly  drive  the 
disease  in  this  patient  subpopulation. It 
is  important  to  note  that  EGER  mutations  are  more 
common  in  East  Asian  patients  and  in  female  gen- 
der.2’13’22  Small  in-frame  deletions  in  exon  19  and  mis- 
sense  mutations  in  exon  21  (L858R  and  L861Q)  are  the 
most  common  mutations  detected  in  EGFR"^"^  and 
were  shown  by  several  ground-breaking  studies  to 
underlie  sensitivity  of  lung  adenocarcinoma  patients 
to  EGFR-targeting  small  tyrosine-kinase  inhibitors 
(e.g.  erlotinib  and  gefitinib).^^’^^’^®  These  studies 
were  the  first  to  prove  the  feasibility  of  personalized 
medicine  approaches  for  the  management  of  lung 
adenocarcinoma  and  represent  the  landmark  for 
the  application  of  genomic  medicine  in  this  disease. 

The  discovery  of  fusions  involving  anaplastic  large¬ 
cell  lymphoma  kinase  {ALK)  with  the  upstream 
partner  echinoderm  microtubule  associated  protein  4 
{EML4)  by  Soda  et  further  opened  new  venues  for 

genomic-driven  personalized  treatment  strategies 
for  lung  adenocarcinoma.^3  Both  EML4  and  ALK  are 
located  in  chromosome  2p,  and  fusion  of  both 
involves  small  inversions  within  this  region.^^  EML4- 
ALK  fusion  results  in  constitutive  activation  of  the 
ALK  kinase  rendering  cells  and  adenocarcinoma 
tumours  expressing  this  oncogenic  fusion  protein 
sensitive  to  ALK  inhibitors.^^“^®  Like  EGER  mutations, 
EML4-ALK  fusion  genes  are  prevalent  in  lung  adeno¬ 
carcinomas,  younger  patients  and,  in  particular,  in 
lifetime  never- smoker  patients  or  light  smokers. 
Importantly,  EML4-ALK  fusion  genes  are  mutually 
exclusive  from  EGER  and  KRAS  mutations,  indicating 
that  such  molecular  defects  function  as  drivers  of 
pathogenesis,  which  is  clinically  important,  as  it 
increases  potential  of  personalized  treatment  options 
that  target  driver  oncogenes  in  this  malignancy.^® 

Other  mutually  exclusive  and,  thus,  potential  onco¬ 
genic  drivers  have  been  identified  in  lung  adeno¬ 
carcinomas.  Mutations  in  HER2/NELf  were  found  by 
Stephens  et  al.  to  occur  in  lung  adenocarcinomas.^^ 
Compared  with  mutations  of  EGER  oncogene,  HER2/ 
NEU  mutations  are  less  frequenC®  and  have  not  been 
successfully  exploited  in  the  clinic  for  lung  adenocar¬ 
cinoma  treatment.3^  Similar  to  HER2/NEU,  mutations 
in  BRAE  also  occur  at  low  frequency  in  lung  adenocar¬ 
cinoma  and  are  exclusive  from  EGER  and  KRAS  muta¬ 
tions,  as  well  as  from  EML-ALK  fusions.^®  There  are 
yet  no  successful  target-specific  treatment  strategies 
for  lung  adenocarcinoma  with  BRAE  mutations.  It  is 
important  to  note  that  mutations  in  HER2/NEU  and 
BRAE  have  not  been  found  in  lung  SCC.^® 

The  TP53  tumour  suppressor  is  the  most  frequently 
mutated  gene  in  lung  adenocarcinoma  (65-70%). 
Various  abnormalities  in  TP53  were  identified  in  lung 
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adenocarcinoma  almost  two  decades  ago^^.^^  ^nd 
more  recently  in  the  tumour- sequencing  projecH^  and 
occur  in  similar  pathways  to  those  mediated  by  the 
oncogenic  driver  mutations  mentioned  earlier.2®’22’37 
Mutations  in  the  GDKN2A  tumour  suppressor  have 
also  been  described  in  lung  adenocarcinoma. 
However,  methylation^®  or  focal  DNA  deletion^^^s 
rather  than  mutation  of  this  tumour  suppressor 
seems  to  be  more  frequent  and  occurs  earlier  in  lung 
cancer  pathogenesis.^  With  the  advent  of  various 
technologies  including  single  nucleotide  polymor¬ 
phism  (SNP)  arrays,  mass  spectrometry  mutational 
analysis  and  more  recently  second-generation 
sequencing,  and  the  undertaking  of  large-scale 
studies  such  as  the  tumour-sequencing  project,^^  our 
knowledge  of  the  mutational  spectrum  of  lung  adeno¬ 
carcinoma  has  substantially  increased.  New  mutated 
oncogenes  and  tumour  suppressor  genes  have  been 
identified  in  lung  adenocarcinoma  and  along  with 
previously  characterized  mutated  genes  are  outlined 
in  Table  1  and  have  been  reviewed  in  detail  else- 
where.3®’3i  It  is  important  to  note  that  many,  if  not 


Table  1  Mutations  in  lung  adenocarcinoma 


— 

Mutation  rate  (%) 

Oncogenes 

KRAS 

15-30 

EGFR 

5-40 

ALK  (fusion) 

5-15 

MET 

14 

KDR 

5 

EPHA3 

5 

MAP2K1 

5 

HER2/ERBB2 

2-4 

FGFR4 

4 

PDGFRA 

4 

NTRK1 

4 

NTRK3 

4 

EPHA5 

4 

ERBB4 

4 

LTK 

3 

PAK3 

3 

ERBB3 

2 

FGFR1 

2 

FGFR2 

2 

NBAS 

2 

PIK3CA 

2 

BRAF 

2 

AKT1 

1 

Tumour-suppressor  genes 

TP53 

50-70 

STK1 

20-30 

LRP1B 

9 

NF1 

7 

ATM 

7 

APC 

6 

PTPRD 

5 

CDKN2A 

5 

RBI 

4 

PTEN 

2 
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most,  of  these  mutations  are  not  mutually  exclusive 
of  other  driver  mutations  and  events  such  as  EGFR 
mutations  and  EML4-ALK  fusions.  For  example, 
PIK3CA  mutations  were  always  found  together  with 
EGER  mutations  in  never- smoker  lung  adenocarcino¬ 
mas.^^  It  is  also  worthwhile  to  mention  that  some  of 
the  outlined  mutations  have  been  detected  in  both 
lung  adenocarcinomas  and  SCC  (e.g.  PIK3GA  and 
MET)  or  only  in  the  former  subtype  of  NSCLC  (e.g. 
MEKl ,  HER2/NEU  and  BRAE)  It  is  unknown  for  most 
of  mutations  occurring  in  lung  adenocarcinomas  and 
recently  identified  by  exon- directing  sequencing  in 
the  tumour- sequencing  project,  whether  they  also 
occur  in  lung  SCC.  The  discovery  of  new  oncogene 
and  tumour  suppressor  mutations  in  lung  adenocar¬ 
cinoma  occurring  in  mutually  exclusive  and  inclusive 
cell  signalling  pathways  expands  the  range  of  possible 
target-specific  and  even  combinatorial  personalized 
therapeutic  strategies  for  this  disease. 


Copy-number  alterations 

Gene  dosage  variations  occur  in  many  pathological 
conditions.  For  example,  in  cancer,  deletions  and 
copy-number  increases  modulate  the  expression  of 
tumour- suppressor  genes  and  oncogenes,  therefore 
contributing  to  tumourigenesis.  Characterization  of 
these  DNA  copy- number  changes  is  vital  for  both  the 
basic  understanding  of  cancer  and  its  diagnosis. 
Copy-number  alterations  are  routinely  assessed  in 
laboratories  by  fluorescent  in  situ  hybridization 
(FISH)  techniques  as  well  as  genomic  polymerase 
chain  reaction  (PCR),  including  quantitative  PCR 
approaches.  However,  these  approaches  are  labour- 
intensive  and  would  hamper  the  discovery  and  com¬ 
plete  understanding  of  the  genome  of  tumours  in 
large-scale  studies.  High-throughput  and  genome¬ 
wide  analysis  of  DNA  copy-number  alterations  was 
made  possible  by  comparative  genomic  hybridization 
(CGH)  approaches,  which  utilize  differentially 
labelled  test  and  reference  genomic  DNA  that  are 
co-hybridized  to  normal  metaphase  chromosomes.^® 
CGH,  however,  exhibits  limited  mapping  resolution 
even  when  compared  with  lower  throughput  higher 
resolution  techniques,  such  as  FISH.®®  Subsequently, 
high-resolution  genome-wide  analysis  was  success¬ 
fully  performed  using  cDNA  micro  array-based  CGH 
and  SNP  arrays  coupled  with  statistical  methods  to 
assess  both  the  amplitude  and  the  frequency  of  copy- 
number  changes  at  each  position  in  the  genome.®® 
Genome-wide  alterations  in  human  lung  adenocar¬ 
cinoma  tumours  stemming  from  several  major 
studies  have  assuredly  increased  our  understanding 
of  the  molecular  pathogenesis  of  this  major  malig¬ 
nancy.®®®®®^  Earlier  chromosomal  CGH  studies  have 
revealed  in  NSCLC  recurrent  gains  at  lq31,  3q25-27, 
5pl3-14  and  8q23-24,  and  deletions  at  3p21,  8p22, 
9p21-22,  13q22  and  17pl2-13.®®“®®  Moreover,  these 
early  studies  already  highlighted  genomic  differences 
and  similarities  between  lung  adenocarcinomas  and 
SCC;  most  prominent  of  which  were  gains  in  3q 
mainly  by  lung  SCC.®®“®®  For  example,  Petersen  et  ah 
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found  genomic  aberrations  that  characterize  lung 
adenocarcinomas  from  SCC,  namely  gain  of  lq23, 
and  the  deletion  at  9q22  were  significantly  associated 
with  adenocarcinomas,  whereas  the  loss  of  chromo¬ 
somal  band  2q36-37  and  gain  of  3q  were  strongly 
associated  with  SCC.®®  Bjorkqvist  et  ah  demonstrated 
that  94%  (15/16)  of  lung  SCC  analysed  had  a  gain  in 
3q,  whereas  only  24%  (4/17)  of  the  adenocarcinoma 
samples  exhibited  a  gain  in  3q,  and  high-level  ampli¬ 
fications  in  3q  were  only  detected  in  SCC.®®  In  addi¬ 
tion,  Luk  etal.  demonstrated  that  gains  at  lq22-32.2, 
15q,  20q  and  losses  at  6q,  13q  and  18q  were  more 
prevalent  in  lung  adenocarcinomas,  whereas  SCC, 
as  shown  in  earlier  studies,  exhibited  gains /amplifi¬ 
cations  at  3q.®^  Moreover,  Pei  etal.  showed  that 
besides  prevalent  gain  of  3q  in  lung  SCC,  gain  of 
20pl3  and  loss  of  4q  also  were  significantly  higher  in 
SCC,  whereas  gain  of  6p  was  more  common  in  adeno¬ 
carcinomas.®®  Massion  et  al.  utilized  higher  resolution 
analysis  by  array  CGH  to  study  copy-number  alter¬ 
ations  of  known  loci  and  found  that  the  most  distinct 
genomic  aberration  between  both  NSCLC  subtypes 
was  gain  of  3q22-26  and  loss  of  3p  by  lung  SCC.®^ 
Moreover,  and  in  the  same  study,  PIK3GA  oncogene 
was  found  to  be  a  member  of  the  chromosome-3q 
amplicon  with  higher  copy  number  and  expression  in 
SCC  but  not  in  adenocarcinomas.®^ 

More  recent  studies  utilized  more  advanced  tech¬ 
nologies  to  query  the  genome  of  lung  adenocarci¬ 
noma  and  associate  specific  gene  modulations  with 
chromosomal  and  loci  gain  or  losses.  In  the  study 
by  Tonon  etal,  high-resolution  cDNA  microarray- 
based  CGH  was  utilized  to  study  the  genomic 
profiles  of  18  lung  adenocarcinomas  and  26  SCC,  as 
well  as  14  NSCLC  cell  lines.®^  The  study  identified 
93  focal  copy- number  alterations  that  mainly 
comprised  previously  un  characterized  recurrent 
high- amplitude  amplifications  and  homozygous 
deletions.®^  Besides  confirming  previous  findings  by 
chromosomal  CGH  and  highlighting  known  gains 
(e.g.  Iq31  and  3q25-27)  and  known  deletions  (e.g. 
3p,  8p22  and  13q22),  the  study  by  Tonon  etal.  was 
able  to  map  specific  genes  to  focal  copy-number 
alterations  including  GDKN2A  and  RBI  tumour- 
suppressor  genes  and  EGER,  and  KRAS  oncogenes.®^ 
However,  when  comparing  both  adenocarcinomas 
and  lung  SCC,  the  study  found  that  the  only  notable 
genomic  difference  between  both  NSCLC  histologi¬ 
cal  subtypes  was  the  well- characterized  gain  of 
3q26-29  in  SCC  that  included  TP63,  well  known  for 
its  role  in  squamous  differentiation,  and  concluded 
that  similar  oncogene  and  tumour-suppressor  gene 
aberrations  drive  lung  adenocarcinoma  and  SCC 
pathogenesis.®^  The  study  by  Tonon  etal.  was  a 
major  step  in  understanding  the  genomic  profiles  of 
NSCLC  tumours  despite  the  small  number  of  lung 
adenocarcinomas  and  SCC  analysed. 

Later  on,  Weir  et  al.  studied  the  genomic  profiles  of 
a  large  collection  of  primary  lung  adenocarcinomas 
{n  =  371)  by  high-density  SNP  arrays  using  238  000 
probe  sets.®®  The  report  by  Weir  et  al.  was  a  milestone 
in  understanding  the  lung  adenocarcinoma  genome, 
as  it  unravelled  previously  uncharacterized  amplified 
genes  and  loci  that  otherwise  may  have  not  been 
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identified  using  a  small  number  of  primary  tumours. 
The  study  identified  39  large-scale  chromosomal  arm 
gain  or  loss,  26  of  which  were  significantly  recurrent 
across  many  lung  adenocarcinomas.  Importantly,  the 
large-scale  study  by  Weir  et  al.  identified  31  recurrent 
focal  events  that  included  24  amplifications  and 
7  homozygous  deletions.  Using  dense  SNP  arrays 
coupled  with  statistical  methods  (genomic  identifica¬ 
tion  of  significant  targets  in  cancer),  the  group  was 
able  to  associate  specific  genes  to  the  focal  events  and 
rank  significance  of  events  based  on  both  the  ampli¬ 
tude  and  frequency  of  copy- number  change,^®  similar 
to  what  was  performed  by  Tonon  et  al.^^  to  identify 
minimal  common  regions  of  copy- number  alter¬ 
ations.  The  most  significant  focal  regions  of  amplifi¬ 
cation  included  known  oncogenes  such  as  MDM2 
(12ql5),  MYC  (8q24),  EGFR  (7pll),  CDK4  (12ql4), 
KRAS  (12pl2),  CCNEl  (19ql2),  ERBB2  (17ql2), 
CCNDl  (llql3)  and  TERT  (5pl5).^^  It  is  important 
to  note  that  three  of  these  oncogenes,  EGER,  KRAS 
and  ERBB2,  are  mutated  in  lung  adenocarcinoma, 
as  discussed  before,  suggesting  that  amplification 
and  mutation  of  these  oncogenes  may  cooperate  sys¬ 
tematically  in  lung  adenocarcinoma  pathogenesis. 
The  most  significant  focal  regions  of  deletions  also 
included  known  tumour-suppressor  genes  such  as 
GDKN2A  (9p21)  and  PTEAr(10q23)."®  Although  5p  was 
previously  shown  to  be  gained  in  lung  adenocarcino¬ 
mas,  the  identity  of  genes  involved  in  this  gain  was 
unknown  prior  to  the  study  by  Weir  et  al.  The  appli¬ 
cation  of  more  advanced  technologies  to  characterize 
the  genomic  profile  of  lung  adenocarcinomas 
enabled  the  group  to  highlight  previously  unknown 
associations  between  canonical  cancer-associated 
genes  and  known  loci  copy-number  alterations,  as 
well  as  to  identify  potentially  new  oncogenes.  For 
example,  10  genes,  including  TERT,  were  found  in  the 
study  to  be  included  in  the  5pl5  region.^®  Further¬ 
more,  the  study  highlighted  previously  uncharacter¬ 
ized  amplification  of  TITE-1/NKX2-1  (14ql3.3)  in  lung 
adenocarcinomas  and  demonstrated  the  oncogenic 
role  of  this  lineage -specific  transcriptional  factor 
in  lung  cancer  cells  evidenced  by  the  effect  of  RNA 
interference-mediated  knockdown  of  its  expression 
on  anchorage-independent  growth  of  lung  adenocar¬ 
cinoma  cell  lines  with  amplification  of  this  gene.^®  The 
amplification/copy-number  gain  of  TITE-1  in  lung 
adenocarcinoma  was  later  confirmed  in  different 
studies  including  that  by  Kwei  et  al.  using  array 
CGH.^^  However,  it  is  important  to  mention  that 
although  TITE-1  amplification  is  generally  pre¬ 
valent  in  lung  adenocarcinoma  and  is  thought 
to  function  as  a  lineage- specific  oncogene  in  this 
subtype  of  NSCLC,  it  has  also  shown  by  FISH  analysis 
in  lung  see,  which  will  be  discussed  later  in  this 
review. 

As  mentioned  before,  numerous  earlier  studies 
have  demonstrated  that  gain  of  3q  is  a  genomic 
feature  of  lung  SCC.  Similar  to  the  aforementioned 
study  by  Weir  et  al.,  Bass  and  colleagues  utilized 
high-density  Affymetrix  SNP  arrays  to  analyse  40 
oesophageal  and  47  lung  SCC,  which  confirmed  that 
gain  of  3q26  was  the  main  focal  amplification  event 
in  lung  and  oesophageal  SCC.^^  Importantly,  the 
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same  study  revealed  the  presence  of  the  ESC  factor 
in  this  region,  which  was  later  on  confirmed  to  be 
amplified  in  SCC  and  not  in  lung  adenocarcinomas, 
and  promoted  the  survival  of  cell  lines  of  the  former 
NSCLC  subtype  harbouring  amplification  of  this 
transcriptional  factor  and  further  suggesting  that 
lung  adenocarcinomas  are  genetically  different  from 
SCC. 

More  recently,  high-resolution  array  CGH  was  per¬ 
formed  on  never- smoker  lung  adenocarcinomas 
{n  =  60)  with  known  mutation  status  of  EGER.^^  This 
study  identified  14  new  minimal  common  regions  of 
gain  or  loss  and  confirmed  previously  known  copy- 
number  alterations  such  as  those  involving  TERT 
(5p),  TITE-1  (14ql3),  EGER  (7p)  and  GDKN2B  (9p). 
Notably,  the  study  revealed  new  genomic  aberra¬ 
tions,  namely  the  16pll.2  region  harbouring  the  EUS 
oncogene  that  functions  in  transcriptional  splicing 
and  DNA  repair.®^  Gain  of  16pll.2  was  evident  in 
greater  than  20%  of  the  never-smoker  lung  adeno¬ 
carcinomas  analysed  and  mRNA  levels  of  EUS  corre¬ 
lated  with  copy  gain  of  16p,  as  they  were  higher  in 
tumours  with  gain  of  this  region  compared  with 
tumours  that  did  not  exhibit  16p  copy  gain.  Impor¬ 
tantly,  the  study  by  Job  et  al.  revealed  genomic  copy- 
number  alterations  that  were  highly  associated  with 
presence  of  EGER  mutation,  an  oncogenic  driver  of 
never-smoker  lung  adenocarcinoma  pathogenesis.®^ 
Gains  of  7p  were  significantly  associated  with 
presence  of  EGER  mutations  and  included  EGER 
gene,  suggesting,  as  mentioned  before,  that  copy- 
number  alternations  and  mutations  cooperate  at 
the  genomic  level  in  lung  adenocarcinoma  patho¬ 
genesis.  However,  it  cannot  be  ignored  that  EGER 
copy  gain  or  amplification  may  favour  the  detection 
of  EGER  mutations  in  a  heterogeneous  tumour  due 
to  the  mutant  allele- specific  imbalance  phenom¬ 
enon.®®  In  a  more  recent  study  by  Yuan  et  al.  and  also 
using  array  CGH  technology,  gains  in  7p,  including 
the  EGER  gene,  were  common  in  EGER  mutant 
lung  adenocarcinomas  and  predicted  overall  and 
recurrence-free  survival  in  this  disease  population.^® 
More  importantly,  in  contrast  to  EGER  mutations, 
presence  of  genes  (including  EGER)  within  the  7p 
gain  predicted  poorer  response  to  tyrosine  kinase 
inhibitors  targeting  EGER.^^ 


Gene  expression  profiling 

Numerous  studies  have  utilized  micro  array  technol¬ 
ogy  to  analyse  the  global  transcriptome  of  NSCLC  for 
diagnosis  (discussed  later),  molecular  classification, 
response  to  therapy  and  prognosis.  Lor  the  purpose 
of  this  review,  we  will  discuss  several  key  studies  that 
investigated  expression  profiles  of  lung  adenocarci¬ 
nomas  to  further  understand  the  molecular  biology 
of  this  prevalent  lung  malignancy.  Bhattacharjee 
and  colleagues  utilized  arrays  to  study  adenocarcino¬ 
mas  of  lung  origin  [n  =  127),  SCC  [n  =  21),  carcinoids 
{n  =  20),  SCLC  {n  =  6)  and  17  normal  samples.^^  The 
study  found  that  differential  expression  profiles  seg¬ 
regated  the  samples  into  different  clusters  based  on 
histology,  evidenced  by  the  two-dimensional  cluster 
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analysis.  Genes  associated  with  squamous  differen¬ 
tiation  such  as  keratin  and  TP63  were  overexpressed 
in  see,  and  neuroendocrine  markers  were  enriched 
in  the  SCLC  cluster.  Importantly,  the  study  by  Bhat- 
tacharjee  etal.  also  analysed  the  adenocarcinomas 
alone  by  hierarchical  clustering  and  demonstrated 
that  the  adenocarcinomas  were  heterogeneous  in 
molecular  make-up,  being  separated  into  various 
clusters  with  distinct  clinical  outcomes. Similarly, 
Garber  etal.  utilized  cDNA  microarrays  to  study 
expression  profiles  of  41  adenocarcinomas,  16  SCC,  5 
large-cell  carcinomas  and  5  SCLC,  as  well  as  5  normal 
lung  samples.^^  Again,  lung  adenocarcinomas  were 
most  heterogeneous  and  were  divided  into  different 
clusters  that  were  associated  with  clinicopathological 
variables  such  as  tumour  grade. Later,  Hayes  etal. 
found  that  adenocarcinoma  subtypes  identified  by 
the  Bhattacharjee  and  Garber  studies  were  reproduc¬ 
ible  in  additional  microarray  datasets.^^  Several  other 
studies  have  also  demonstrated,  using  microarray 
expression  profiling  technology,  the  heterogeneity  of 
lung  adenocarcinomas  and  their  distinction  from 
other  lung  cancer  subtypes.  As  reviewed  by  Yatabe, 
global  gene  expression  profiling  was  able  to  subdi¬ 
vide  lung  adenocarcinomas  into  various  clusters  that 
correlated  with  EGFR  mutation  status,  prognosis, 
expression  of  lung  peripheral  airway  markers  such  as 
surfactant  proteins  (SP)  and  CCid,  as  well  as  enrich¬ 
ment  of  the  BAG  subtype.^^ 


Lung  adenocarcinoma  preneoplasia 

From  biological  and  histopathological  perspectives, 
NSCLC  is  a  complex  malignancy  that  develops 
through  multiple  preneoplastic  pathways.  Lung 
adenocarcinoma,  a  major  subtype  of  NSCLC,  has 
been  increasing  in  incidence  globally  in  both 
smokers  and  non-smokers^^  with  a  concurrent 
decrease  in  SCC  frequency.  It  has  been  postulated 
that  the  increasing  incidence  of  lung  adenocarcino¬ 
mas  compared  with  SCC  is  in  part  due  to  the  change 
in  the  type  of  cigarettes  used  (lower  nicotine  and  tar) 
and  smoking  habits  and  behaviour.^  Anatomical  dif¬ 
ferences  in  the  location  of  diagnosed  lung  adenocar¬ 
cinomas  and  SCC  strongly  suggest  that  both  NSCLC 
subtypes  develop  through  different  histopathological 
and  molecular  pathways  and  have  different  cells  of 
origin;  however,  the  specific  respiratory  epithelial 
cell  type  from  which  each  lung  cancer  type  develops 
has  not  been  established  with  certainty.^  Lung  SCC 
is  typically  centrally  located  in  the  lung  and  is 
thought  to  arise  from  the  major  bronchi.  In  contrast, 
lung  adenocarcinomas  that  are  usually  peripherally 
located  are  believed  to  arise  from  small  bronchi, 
bronchioles  or  alveoli  of  the  distant  airways  of  the 
lung.  The  sequence  of  histopathological  changes  in 
bronchial  epithelia  that  precede  the  development  of 
lung  SCC  has  been  characterized.®’^^  However,  the 
sequential  preneoplastic  changes,  as  well  as  the  cor¬ 
responding  molecular  abnormalities,  leading  to  the 
development  of  lung  adenocarcinomas  are  poorly 
documented. 
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Histopathological  development  of 
lung  adenocarcinoma 

Clara  cells  and  the  type  II  pneumocytes  are  believed 
to  be  the  progenitor  cells  of  the  peripheral  airways, 
and  peripherally  arising  adenocarcinomas  often 
express  markers  of  these  cell  types.^^’^^  Atypical 
adenomatous  hyperplasias  (AAH)  are  considered  to 
be  a  precursor  lesion  for  peripheral  lung  adenocarci¬ 
nomas.®’^  However,  and  until  now,  AAH  is  the  only 
sequence  of  morphological  change  identified  so  far 
for  the  development  of  invasive  lung  adenocarcino¬ 
mas,  and  there  is  consensus  that  the  pathogenesis  of 
many  adenocarcinomas  is  largely  unknown.  The  pos¬ 
tulated  progression  of  AAH  to  adenocarcinomas  in 
situ,  which  is  characterized  by  the  growth  of  neoplas¬ 
tic  cells  along  pre-existing  alveolar  structures  without 
invasion,  is  supported  by  molecular  studies.^®  Distinc¬ 
tion  between  highly  atypical  AAH  and  what  was 
known  as  BAG  is  sometimes  difficult.  Therefore,  and 
as  mentioned  before,  the  ERS,  lASLC  and  ATS  spon¬ 
sored  a  new  classification  of  lung  adenocarcinoma 
that  presented  several  modifications  to  the  WHO  2004 
criteria  for  diagnosis  of  resected  adenocarcinoma 
specimens.  The  term  BAG  was  suggested  to  be  discon¬ 
tinued  and  replaced  with  AIS  and  MIA  used  for  small 
adenocarcinomas  with  either  pure  lepidic  growth  or 
predominant  lepidic  growth  with  less  than  5  mm 
invasion,  respectively.  Importantly,  the  clinical  fea¬ 
tures  of  both  adenocarcinoma  progression  types  are 
unique  as  patients  with  AIS  or  MIA  have  a  100%  5 -year 
survival  rate  after  respective  surgery.^® 

The  differentiation  phenotype  derived  from  immu- 
nohistochemical  and  ultrastructural  features  indi¬ 
cates  that  AAH  originate  from  the  progenitor  cells  of 
the  peripheral  airways. Surfactant  apoprotein 
and  Clara  cell-specific  10-kDd  protein  are  expressed 
in  almost  all  AAH.  In  addition,  an  increasing  body  of 
evidence  suggests  that  AAH  is  the  precursor  of  at  least 
a  subset  of  adenocarcinomas.  For  example,  AAH  is 
most  frequently  detected  in  lungs  of  patients  bearing 
lung  cancers  (9-20%),  especially  adenocarcinomas 
(as  many  as  40%),  compared  with  lung  SCC  (11%).^®  It 
is  important  to  note  that  AAH  is  detected  more  fre¬ 
quently  in  East  Asian  patients  relative  to  Western 
patients.  In  such  studies,  it  has  been  suggested  that 
AAH  is  involved  in  the  linear  progression  of  cells  of  the 
Terminal  respiratory  unit’  (TRU)  to  AIS  and  subse¬ 
quently  invasive  adenocarcinomas^’^^’^^  due  to  the 
expression  of  common  genes  between  the  TRU  and 
the  AAH,  which  is  discussed  later.  Such  studies  have 
postulated  that  most,  if  not  all,  peripheral  lung 
adenocarcinomas  progress  from  alveoli  through  AAH 
as  a  preneoplastic  lesion.  As  will  be  discussed  further 
later,  we  have  noted  similar  molecular  abnormalities 
(e.g.  EGFR  mutations)  between  adenocarcinomas 
arising  in  never- smokers  and  small  bronchioles 
within  the  localized  and  adjacent  fields  of  the  adeno¬ 
carcinomas,  suggesting  that  lung  adenocarcinomas 
may  arise  from  bronchiolar  epithelium  and  small 
bronchi,  and  not  only  from  alveoli.^^’^®  In  a  recent 
review  by  Yatabe  etal.,  a  nonlinear  progression 
schema  for  lung  adenocarcinomas  was  suggested.^  In 
this  nonlinear  schema,  Yatabe  et  al.  postulated  that 
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lung  adenocarcinomas  of  the  TRU  subtype,  as  named 
by  the  authors,  develop  through  AAH.  On  the  other 
hand,  and  according  to  the  same  nonlinear  progres¬ 
sion  hypothesis,  some  lung  adenocarcinomas  arise 
through  unknown  preneoplastic  precursors  from 
other  cells  besides  the  TRU,  which  we  believe  may  as 
well  be  the  bronchiolar  epithelium 

Molecular  pathogenesis  of  lung 
adenocarcinomas 

Several  molecular  changes  frequently  present  in  lung 
adenocarcinomas  are  also  present  in  AAH  lesions, 
and  they  are  further  evidence  that  AAH  may  represent 
true  preneoplastic  lesions/^  The  most  important 
finding  is  the  presence  of  KRAS  (codon  12)  mutations 
in  as  many  as  39%  of  AAH,  which  are  also  a  relatively 
frequent  alteration  in  lung  adenocarcinomas.®’^® 
Other  molecular  aberrations  that  were  identified  in 
AAH  are  overexpression  of  cyclin  D1  (-70%),  survivin 
(48%)  and  HER2lneu  (7%)  proteins.®  Moreover,  and  as 
mentioned  in  the  review  by  Wistuba  and  Gazdar, 
some  AAH  lesions  were  found  to  exhibit  loss  of 
heterozygosity  (LOH)  in  chromosomes  3p  (18%),  9p 
^plffNK4a^  13%),  9q  (53%),  17q  and  17p  (TP53,  6%).®  It  is 
noteworthy  that  most  if  not  all  of  the  aforementioned 
changes  identified  in  AAH  lesions  are  also  frequently 
detected  in  lung  adenocarcinomas.  Later,  AAH  lesions 
were  shown  to  exhibit  LOH  of  tuberous  sclerosis 
complex  (TSC) -associated  regions,  activation  of 
telomerase,  loss  of  LKBl,  overexpression  of  DICER,  a 
key  effector  protein  for  small  interfering  RNA  and 
miRNA  function,  and  DNA  methylation  of  CDKN2A 
and  PTPRN2A^^’^^  It  is  important  to  note  that  several 
studies  have  attempted  to  globally  comprehend  dif¬ 
ferential  gene  expression  patterns  and  copy-number 
alterations  between  low-grade  lesions  (e.g.  precursor 
lesions)  or  in  situ  adenocarcinomas  and  invasive 
tumours  and  found  that  amplification  of  the  EGFR 
oncogene  was  the  predominant  differential  molecular 
feature  between  the  two  different  adenocarcinoma 
grade  classes  and  occurred  after  mutations  in  the 
gene.^  Importantly,  as  will  be  discussed  in  the  next 
section  of  this  review,  EGER  mutations  also  preceded 
changes  in  copy  number  of  the  gene  when  studying 
histologically  normal  bronchiolar  epithelia.^® 

KRAS  and  EGFR  mutations  in  lung 
adenocarcinoma  pathogenesis 

Although  there  is  only  one  sequence  of  morphological 
change  characterized  so  far  for  the  development  of 
invasive  lung  adenocarcinomas,  namely  AAH,  a  large 
body  of  evidence  suggests  that  at  least  two  molecular 
pathways  are  involved,  the  KRAS  and  EGER  pathways 
in  smoker  and  never- smoker  adenocarcinoma 
subpopulations,  respectively. Mutations  in 
EGER,  in  particular,  in-frame  deletions  of  exon  19  and 
L858R  and  L861Q  of  exon  21,  are  strongly  associated 
with  never-smoking  status,  female  gender  and  East 
Asian  ethnicity,  as  well  as  predict  favourable  response 
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to  EGER  tyrosine  kinase  inhibitor. On  the 
other  hand,  mutations  in  KRAS,  the  most  frequently 
mutated  oncogene  in  lung  adenocarcinoma,  based 
on  recent  findings  of  the  tumour-sequencing  project, 
are  strongly  associated  with  development  of  adeno¬ 
carcinomas  linked  to  tobacco  consumption.^’^®’^^’^®’^^’^® 

It  has  been  suggested  that  the  vast  majority  of  AAH 
precursor  lesions  and  adenocarcinomas  in  situ  are 
associated  with  the  TRU  adenocarcinoma  subtype 
that  were  found  to  express  high  levels  of  TITE-1  and 
SP,  leading  to  the  conclusion  that  such  adenocarcino¬ 
mas  are  of  the  same  lineage  as  terminal  airway  epi¬ 
thelial  cells.  In  addition,  it  has  been  postulated  that 
EGER  mutations  are  predominant  in  or  specific  to 
peripheral  lung  adenocarcinomas  of  the  TRU 
subtype,  which  were  suggested  to  arise  from  AAH 
lesions,^^’^^’®®  as  90  of  97  EGER  mutant  adenocarcino¬ 
mas  were  positive  for  TITE-1,  and  91  of  the  97 
tumours  were  of  the  TRU  subtype.®®  In  addition,  the 
hypothesis  put  forward  that  EGER  mutations  are 
associated  with  or  specific  to  the  TRU  subtype  of  lung 
adenocarcinomas  is  also  in  part  due  to  the  observa¬ 
tion  that  the  frequency  of  EGER  and  KRAS  mutations 
among  AAH  lesions,  adenocarcinomas  in  situ  and 
invasive  adenocarcinomas  is  significantly  different. 

It  was  determined  that  whereas  KRAS  mutations 
decreased  along  adenocarcinoma  progression,  from 
33%  in  AAH  to  8%  in  adenocarcinomas,  EGER  muta¬ 
tions  were  evenly  distributed  suggesting  that 
KRAS-mutaXed  AAH  lesions  rarely  progress  to  adeno¬ 
carcinomas.  It  is  also  important  to  mention,  and  as 
reviewed  by  Yatabe,  that  several  studies  performed 
gene  expression  profiling  of  lung  adenocarcinomas 
and  other  histological  subtypes  of  lung  cancer  and 
found  that  lung  adenocarcinomas  were  heteroge¬ 
neous  and  divided  into  different  clusters.^^  Clusters 
with  expression  of  GGIO  and  features  of  alveolar  sig¬ 
nature  such  as  TITE-1  exhibited  significantly  better 
survival  compared  with  adenocarcinomas  in  other 
clusters  and  comprised  a  higher  frequency  of  EGER 
mutations. 

Mutations  in  the  tyro  sine -kinase  domain  of  EGER 
mutations  were  shown  to  be  involved  in  the  early 
pathogenesis  of  lung  cancer,  being  identified  in  histo¬ 
logically  normal  epithelium  of  small  bronchi  and 
bronchioles  adjacent  to  EGER  mutant  adenocarcino¬ 
mas^^  (discussed  further  in  the  next  section  of  the 
review).  EGER  mutations  were  detected  in  normal¬ 
appearing  peripheral  respiratory  epithelium  in  43% 
adenocarcinoma  patients,^^  but  not  in  patients 
without  mutation  in  the  tumour. These  findings  may 
signify  different  cell  types  comprising  the  examined 
epithelia,  which  could  represent  sites  of  the  cells  of 
origin  for  EGER  mutant  adenocarcinomas  of  the  lung. 
Although  the  cell  type  having  those  mutations  is 
unknown,  our  group  has  hypothesized  that  stem  or 
progenitor  cells  of  the  bronchial  and  bronchiolar  epi¬ 
thelium  bear  such  mutations.  It  is  also  noteworthy  that 
EGER  mutations  were  identified  in  only  3  of  40  AAH 
lesions  examined®®  ®^  and  were  shown  to  be  absent®®  or 
relatively  infrequent  in  what  was  previously  known  as 
BAG  of  the  lung.®^  These  earlier  observations  support 
the  argument  that  abnormalities  of  EGER  are  not  only 
relevant  to  the  pathogenesis  of  alveolar-type  lung 
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neoplasia  but  also  may  play  drive  peripheral  lung 
adenocarcinoma  from  bronchiolar  epithelium  cells 
that  are  distinct  from  terminal  respiratory  and  alveolar 
cells. The  different  findings  of  EGFR  mutation 
rates  in  AAH  lesions  may  as  well  reflect  the  ethnicity 
(Asian  vs  Western)  of  the  patients  from  which  the 
lesions  were  isolated,  as  well  as  the  standard  practice 
of  detection  of  small  lesions  such  as  AAH. 


Field  cancerization 

Although  the  majority  of  lung  cancer  patients  are 
current  or  former  smokers  (approximately  85%),  a 
relatively  small  fraction  of  smokers  (approximately 
15%)  develop  primary  lung  tumours.  Patients  with 
early  stage  NSCLC,  relative  to  other  early  stage  malig¬ 
nancies,  frequently  exhibit  recurrence  or  second 
primary  tumour  development  after  definitive  treat¬ 
ment  by  surgery  and  removal  of  the  original  lung 
primary  tumour.  There  is  a  large  body  of  evidence  that 
heavy  smokers  and  patients  who  have  survived  an 
upper  aerodigestive  cancer  comprise  a  high-risk 
population  that  may  be  targeted  for  early  detection 
and  chemoprevention  efforts.®  Although  the  risk  of 
developing  lung  cancer  decreases  after  smoking 
cessation,  the  risk  never  returns  to  baseline.  Preneo- 
plastic  changes,  namely  dysplastic  histological  abnor¬ 
malities,  have  been  utilized  as  surrogate  endpoints  for 
chemopreventive  studies.  However,  it  was  suggested 
that  this  ‘shooting-in-the-dark’  approach  may  explain 
the  reasons  behind  the  general  failures  of  clinical 
chemoprevention  studies.®  It  is  also  important  to  note 
that  we  are  unable  to  predict  which  lifetime  never- 
smokers  or  definitively  treated  never- smoker  early- 
stage  lung  cancer  patients  will  develop  lung  tumours 
or  relapse.  Therefore,  novel  approaches  to  identify  the 
best  population  to  be  targeted  for  early  detection  and 
chemoprevention  should  be  devised,  and  risk  factors 
for  lung  cancer  development  or  relapse  need  to  be 
better  defined.  For  these  important  purposes,  a  better 
understanding  of  the  biology  and  molecular  origins  of 
lung  cancer,  for  example,  lung  adenocarcinoma,  is 
warranted.  In  this  section  of  the  review,  we  will 
describe  the  field  cancerization  phenomenon  that 
herein  refers  to  that  occurring  due  to  direct  and  indi¬ 
rect  effects  of  smoking  (field  of  injury)  or  independent 
of  smoking  in  patients  with  and  without  cancer,  with 
emphasis  on  aberrant  molecular  markers  in  histologi¬ 
cal  normal  epithelia  that  can  be  used  to  increase  our 
understanding  of  lung  cancer  pathogenesis. 


Smoking  damaged  epithelium  and  the  lung 
field  cancerization  phenomenon 

Earlier  work  by  Danely  Slaughter  et  al.  in  patients 
with  oral  cancer  and  oral  premalignant  lesions  has 
suggested  that  histologically  normal- appearing  tissue 
adjacent  to  neoplastic  and  preneoplastic  lesions 
display  molecular  abnormalities,  some  of  which  are 
in  common  with  those  in  the  tumours.®®  In  1961,  a 
seminal  report  by  Auerbach  et  al.  suggested  that  ciga¬ 


rette  smoke  induces  extensive  histological  changes  in 
the  bronchial  epithelia  in  the  lungs  of  smokers  and 
that  premalignant  lesions  are  widespread  and  multi¬ 
focal  throughout  the  respiratory  epithelium,  sugges¬ 
tive  of  a  field  effect.®®  This  phenomenon,  coined 
‘field  of  cancerization’,  was  later  shown  to  be  evident 
in  various  epithelial  cell  malignancies  including 
lung  cancer.  Some  degree  of  inflammation  and 
inflammatory-related  damage  is  almost  invariably 
present  in  the  central  and  peripheral  airways  of 
smokers  and  may  precede  the  development  of  lung 
cancer.®^  Thus,  the  field  of  cancerization  may  be 
explained  by  both  direct  effect  of  tobacco  carcinogens 
and  initiation  of  inflammatory  response.  In  this 
context,  different  theories  for  the  origin  of  the  field  of 
cancerization  or  smoking-related  field  of  injury  have 
been  put  forward  and  will  not  be  discussed  here,  as 
they  have  been  nicely  and  extensively  reviewed  else¬ 
where  by  Steiling  et  al.^^ 

Several  studies  focusing  on  the  respiratory  epithe¬ 
lium  of  lung  cancer  patients  and  smokers  have  dem¬ 
onstrated  that  multiple  altered  foci  of  bronchial 
epithelium  are  present  throughout  the  airway.®^  ®®’®®  A 
detailed  analysis  of  histologically  normal  epithelium, 
and  premalignant  and  malignant  epithelia  from  lung 
see  patients  indicated  that  multiple,  sequentially 
occurring  allele- specific  chromosomal  deletions  of 
LOH  begin  in  dispersed  clonally  independent  foci 
very  early  in  the  multi-stage  pathogenesis  of  this 
smoking-related  lung  malignancy.®^  ®®  Notably,  31%  of 
histologically  normal  epithelium  and  42%  of  mildly 
abnormal  (hyperplasia/metaplasia)  specimens  had 
clones  of  cells  with  allelic  loss  at  one  or  more  regions 
examined.  Moreover,  these  molecular  aberrations 
were  also  found  in  carcinomas  in  situ  and  SCC,  and  at 
a  more  advanced  level.®^  Molecular  changes  involving 
LOH  of  chromosomal  regions  3p  {DDUT  and  FHIT 
genes),  9p  iCDKN2A),  genomic  instability  (increased 
microsatellite  repeats)  and  pl6  methylation  have  all 
shown  to  commence  in  histologically  normal  or 
slightly  abnormal  tissue  in  SCC  patients  and  in  the 
sequence  of  pathogenesis  of  the  disease.®  As  men¬ 
tioned  before,  KRAS  is  the  most  mutated  oncogene  in 
lung  adenocarcinomas.®^  Almost  15  years  ago.  Nelson 
et  al.  demonstrated  that  KRAS  mutations  are  found  in 
histologically  normal  lung  tissue  adjacent  to  lung 
tumours.®®  As  will  be  discussed  later,  mutations  in 
EGFR  were  also  found  in  adjacent  to  tumour  histo¬ 
logically  normal  epithelium. ^^’^®  Similar  epigenetic 
and  gene  methylation  patterns  between  tumours 
and  adjacent  histologically  normal  epithelia  were 
described.  An  important  study  by  Belinsky  et  al. 
reported  aberrant  promoter  methylation  of  pi 6, 
which  was  described  to  be  commonly  methylated  in 
lung  tumours,®^  in  at  least  one  bronchial  epithelial  site 
from  44%  of  lung  cancer  patients  examined.®®  More¬ 
over,  pl6  and  death-associated  protein  kinase  (DAPK) 
promoter  methylation  was  observed  frequently  in 
bronchial  epithelium  from  current  and  former 
smoker  but  not  from  never- smoker  lung  cancer 
patients  and  persisted  after  smoking  cessation. 
Notably,  94%  of  lung  tumours  exhibited  a  concordant 
pattern  of  pl6  methylation  with  that  in  at  least  one 
bronchial  epithelial  site.®® 


©  201 1  The  Authors 

Respirology  ©  2011  Asian  Pacific  Society  of  Respirology 


Respirology  (2012)  17,  50-65 


58 


H  Kadara  et  al. 


The  aforementioned  molecular  abnormalities  were 
detected  in  histologically  normal  epithelia  adjacent  to 
archival  surgically  resected  tumours  from  primary 
lung  cancer  patients.  LOH  and  micro  satellite  alter¬ 
ations  in  multiple  foci  were  also  detected  in  distal 
histological  normal  bronchial  epithelia  of  smokers 
without  cancer. Moreover,  and  importantly,  these 
molecular  abnormalities  were  detected  in  bronchial 
epithelia  of  cancer-free  former  smokers  that 
appeared  to  have  persisted  for  many  years  after 
smoking  cessation.  In  addition,  LOH  was  detected  in 
DNA  obtained  from  bronchial  brushings  of  normal 
and  abnormal  lungs  from  patients  undergoing  diag¬ 
nostic  bronchoscopy  and  was  detected  in  cells  from 
the  ipsilateral  and  contralateral  lung.^^  Mutations  in 
TP53  were  also  described  to  occur  in  bronchial  epi¬ 
thelia  of  cancer-free  smokers  in  a  widely  dispersed 
manner.^®  Similar  evidence  also  exists  for  promoter 
methylation  and  epigenetic  changes  in  smoking- 
damaged  lung  epithelium  of  cancer-free  patients. 
Methylation  of  various  genes,  including  retinoic  acid 
receptor  2  beta  iRAR-b2),  H-cadheririy  APC,  pl6  and 
RASSFFIA  was  described  in  bronchial  epithelial  cells 
of  heavy  smokers.®^  Moreover,  methylation  of  pl6, 
GSTPl  and  DAPK was  reported  to  be  evident  in  bron¬ 
chial  brushings  of  one  third  of  the  cancer-free 
smokers  examined.^®  In  the  same  study  by  Belinsky 
etal,  as  mentioned  before,  methylation  of  pl6  was 
detected  in  epithelia  of  cancer- free  smokers. A  more 
detailed  list  of  aberrant  gene  promoter  methylation  in 
lung  cancer  patients  and  cancer-free  smokers  is 
nicely  summarized  and  explained  in  the  review  by 
Heller  et 

Gene  expression  profiling  of  the  lung 
field  cancerization 

High-throughput  microarray  profiling  was  used  by 
several  groups  to  study  the  transcriptome  of  lung 
airways.  Hackett  et  al.  utilized  microarrays  to  study 
the  expression  of  44  anti- oxidant-related  genes 
using  bronchial  brushings  from  cancer- free  current 
smokers  and  never- smokers,  and  found  significant 
upregulation  of  16  of  the  antioxidant  genes  in  the 
airways  of  smokers  compared  with  non-smokers. 
Later,  Spira  et  al.  described  global  alterations  in  gene 
expression  between  normal- appearing  bronchial  epi¬ 
thelium  of  healthy  cancer-free  smokers  and  that  of 
non-smokers. In  addition,  and  in  the  reports  by 
Spira  etal.  and  Beane  et  al.,  irreversible  changes  in 
expression  in  airways  of  former  smokers  after  years  of 
smoking  cessation  were  described  that  were  thought 
to  underlay  the  increased  risk  former  smokers 
display,  compared  with  never- smokers,  for  develop¬ 
ing  lung  cancer  long  after  they  have  discontinued 
smoking. Schembri  etal.  also  reported  alter¬ 
ations  in  the  expression  of  miRNA  between  large 
airways  of  current  and  never-smokers.^®^  Notably,  an 
80 -gene  signature  was  derived  from  the  transcrip¬ 
tome  of  large  airway  epithelial  cells  that  can  distin¬ 
guish  smokers  without  overt  cancer  from  smokers 
with  lung  cancer  and  exhibited  statistically  significant 
utility  characteristics  of  a  lung  cancer  biomarker, 
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despite  originating  from  normal  bronchial  epithe- 
lia.^°^  Moreover,  the  80 -gene  signature,  using  publicly 
available  micro  array  datasets,  was  able  to  distinguish 
lung  tumours  from  corresponding  normal  lung  tis¬ 
sues.  More  recently,  Gustafson  et  al.  derived  a  phos- 
phatidylinositol  3 -kinase  iPI3K)  pathway  activation 
signature  using  recombinant  adenoviruses  to  express 
the  1 10a  subunit  of  PI3K in  primary  human  epithelial 
cells.  The  same  study  then  demonstrated  that  the 
PI3K  pathway  activation  signature  was  elevated  in 
cytologically  normal  bronchial  airways  of  smokers 
with  lung  cancer  and  with  dysplastic  lesions.  Of 
substantial  clinical  importance,  the  study  found 
that  the  signature  was  decreased  in  the  airways 
of  high-risk  smokers  whose  dysplastic  lesions 
regressed  following  treatment  with  the  inhibitor 
myoinositol. 

Micro  array  and  gene  expression  profiling  method¬ 
ologies  were  also  used  to  demonstrate  the  wide 
anatomical  spread  of  the  lung  field  cancerization 
to  epithelial  regions  that  can  be  non-invasively 
sampled  when  devising  approaches  for  early  detec¬ 
tion  of  lung  cancer.  Sridhar  etal.  highlighted 
common  gene  expression  alterations  in  bronchial, 
nasal  and  buccal  epithelia  of  smokers,  in  particular 
of  various  detoxification  genes  that  perpetuate  the 
field  of  cancerization  due  to  tobacco  consump¬ 
tion.  In  addition,  Zhang  etal.  identified  119  genes 
whose  expression  was  affected  by  smoking  similarly 
in  both  bronchial  and  nasal  epithelium,  including 
genes  related  to  detoxification,  oxidative  stress  and 
wound  healing,^^^  and  the  study  by  Boyle  etal.  high¬ 
lighted  significant  similarities  in  expression  changes 
between  smokers  and  never-smokers  in  oral  and 
bronchial  epithelia. 


Lung  adenocarcinoma  field  cancerization 

To  better  understand  the  pathogenesis  of  EGFR 
mutant  lung  adenocarcinomas,  Tang  and  colleagues 
investigated  the  presence  of  EGFR  mutations  in 
normal  bronchial  and  bronchiolar  epithelium  adja¬ 
cent  to  EGFR  mutant  tumours.  As  mentioned  before, 
EGFR  mutations  were  detected  in  histologically 
normal  peripheral  epithelia  in  44%  of  lung  adenocar¬ 
cinoma  patients  with  mutations  but  none  in  patients 
lacking  mutations  in  the  oncogene. Moreover,  the 
same  study  highlighted  more  frequent  EGFR  muta¬ 
tions  in  normal  epithelium  within  the  tumour  (43%) 
than  in  adjacent  sites  (24%)  suggests  a  localized  field- 
effect  phenomenon  for  this  abnormality  in  the  respi¬ 
ratory  epithelium  of  the  lung.^^  In  addition,  a  higher 
frequency  of  mutations  in  cells  obtained  from  small 
bronchi  (35%)  compared  with  bronchioles  (18%)  was 
detected. More  recently,  EGFR  protein  overexpres¬ 
sion,  similar  to  mutation  of  the  gene,  also  exhibited  a 
localized  field  effect,  as  it  was  more  frequent  in 
normal  bronchial  epithelia  sites  within  tumours  than 
in  sites  adjacent  to  and  distant  from  tumours.^® 
Interestingly,  EGFR  copy- number  alteration  was 
not  evident  in  normal  bronchial  epithelia,  which 
is  in  accordance  with  findings  that  EGFR  copy 
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number  is  relatively  a  late  event  in  pathogenesis  of 
adenocarcinomas/’^® 


Field  cancerization  compartmentalization 

The  low  frequency  of  molecular  abnormalities 
detected  in  the  centrally  located  bronchial  respiratory 
epithelium  in  patients  with  peripheral  lung  adenocar¬ 
cinomas,  compared  with  specimens  from  patients 
with  see  and  SeLe,®®  suggests  the  presence  of  two 
compartments  in  the  lung  with  different  degrees  of 
smoking-related  genetic  damage.  Thus,  smokers  who 
develop  See  have  more  smoking- related  genetic 
damage  in  the  respiratory  epithelium  of  the  central 
airway,  whereas  patients  who  develop  adenocarci¬ 
noma  have  damage  mainly  in  the  peripheral  airways 
(small  bronchus,  bronchioles  and  alveoli).  While 
some  molecular  changes  (e.g.  inflammation  and  sig¬ 
nalling  pathways  activation)  have  been  detected 
throughout  the  lung  airway  and  include  both  com¬ 
partments  (central  and  peripheral  airway),  other 
aberrations  have  been  more  frequently  altered  in 
either  central  (e.g.  LOH,  genetic  instability  evidenced 
by  microsatellite  repeats)  or  peripheral  (e.g.  EGFR 
mutations)  airways. 


Lineage-specific  genes  in  lung  cancer 

The  transformation  of  normal  cells  into  tumourigenic 
counterparts  is  mediated  by  a  complex  array  of  intra¬ 
cellular  signals,  as  well  as  genetic  and  epigenetic 
regulation.  It  has  been  suggested  that  lineage- specific 
genes,  which  play  important  roles  in  normal  develop¬ 
mental  processes  such  as  organogenesis  or  tissue 
homeostasis  and  remain  to  be  expressed  or  become 
amplified  during  an  acquired  pathological  condition, 
are  crucial  for  maintenance  of  the  disease  state.®^’^®® 
Interestingly,  lineage  genes  can  discriminate  different 
subtypes  of  the  same  cancer  that  rise  from  dissimilar 
cells/progenitors,  for  example,  adenocarcinomas 
versus  squamous  tumours,  and  might  offer  new 
insights  into  crucial  and  therapeutically  pliable 
tumour  dependencies.^®®  Various  studies  have  high¬ 
lighted  the  potential  ‘addiction’  of  tumour  cells  to 
aberrant  and  growth-promoting  cell  signalling  medi¬ 
ated  by  lineage-specific  oncogenes,  for  example, 
presence  of  the  BCR-ABL  fusion  oncoprotein  in 
chronic  myelogenous  leukemia, “®  mutations  in  the 
KIT  oncogene  in  gastrointestinal  stromal  tumours, 
amplification  of  the  microphthalmia- associated  tran¬ 
scriptional  factor  {MITF)  in  melanoma^®  and,  more 
recently,  amplification  of  PAX8  in  ovarian  cancer. 
Two  lineage-specific  oncogenes  have  been  character¬ 
ized  in  NSCLC.  Recently,  TITF-1  amplification  and 
protein  expression  were  shown  to  be  prevalent  in 
lung  adenocarcinomas  and  elicit  growth-promoting 
signals  in  this  malignancy.®^  ®®  The  master  ESC  tran¬ 
scriptional  factor  SOX2  was  shown  to  be  a  member  of 
the  3q  locus  (3q26.3)  that  is  specifically  amplified  in 
lung  and  oesophageal  squamous  carcinomas.®®  These 
findings  demonstrate  that  TITF-1  and  SOX2  function 
as  lineage- specific  oncogenes  in  lung  adenocarcino- 
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mas  and  SCC,  respectively,  and  that  targeting  path¬ 
ways  downstream  of  those  two  master  regulators  may 
leverage  new  therapeutic  strategies  independently  for 
each  NSCLC  subtype. 


TITF-1 

TITF-1  is  a  homeo  domain -containing  transactivating 
factor  predominantly  expressed  in  the  terminal  lung 
bronchioles  and  lung  periphery  in  the  developing  and 
adult  mouse. In  addition,  TITF-1  is  crucial  for 
branching  morphogenesis  during  normal  lung 
development^ ^^““®  and  trans activates  the  expression 
of  the  SP,  such  as  SP-A,  -B  and  -C,  which  are  in  turn 
typically  expressed  in  the  Clara  cells  and  are  impor¬ 
tant  for  the  differentiation  of  alveolar  type  II  pneu- 
mocyte  cells  in  the  peripheral  lung.“^ 

Several  studies  have  demonstrated  increased  copy 
number  and  amplification  of  the  14ql3.3  locus  that 
harbours  the  TITF-1  gene  as  well  as  paired  box  tran¬ 
scriptional  factor  family  member  9  iPAX9)  and 
A/XX2.8.®^’®®  It  is  postulated  that  TITF-1  functions  as  a 
lineage -specific  oncogene  in  lung  adenocarcinoma  as 
knockdown  of  TITF-1  expression,  in  cells  with  ampli¬ 
fication  of  the  gene,  by  RNA  interference  results  in 
lung  adenocarcinoma  cell-growth  inhibition  and 
apoptosis  demonstrating  a  lineage -specific  depen¬ 
dency  of  lung  adenocarcinomas  on  r/TF-i.®^®® 
Kendall  et  ah  demonstrated  that  co- amplified  TITF-1, 
PAX9  and  NKX2.8  exhibit  oncogenic  cooperation  and 
cell  prosurvival  and  proliferative  properties.  “®  Over¬ 
expression  of  both  TITF-1  and  NiOS.  8  simultaneously 
in  BEAS-2B  immortalized  human  bronchial  epithelial 
cells  elicited  the  highest  increase  in  cell  colony  growth 
compared  with  single-gene  transfected  cells. “®  More¬ 
over,  pathway  gene  signatures  that  overlap  down¬ 
stream  of  both  TITF-1  and  NKX2.8  defined  lung 
adenocarcinoma  patients  with  most  dismal  prognosis 
compared  with  signatures  downstream  of  either 
transcriptional  factor  alone. “®  However,  recently  in 
KiMS(LSL-G12D/-F);p53(fiox/fiox)  mice,  TITF-1  was 
shown  to  suppress  tumourigenesis  and  limit  meta¬ 
static  potential  in  vivo 

Our  group  and  others  have  demonstrated  that 
TITF-1  copy- number  gain  or  amplification  is  associ¬ 
ated  with  poor  prognosis  in  NSCLC.  In  contrast  to 
the  expected  pro -survival  properties  of  a  cell-lineage 
oncogene  and  the  association  of  TITF-1  copy- number 
gain  and  amplification  with  poor  survival,  TITF-1 
protein  expression  by  immunohisto chemistry  was 
shown  to  be  a  marker  of  favourable  prognosis  in 
NSCLC^®®“^®®  including  early  stage  (stage  I)  lung 
adenocarcinoma.^®®  It  is  worthwhile  to  mention  that 
TITF-1  protein  expression  and  TITF-1  gene  copy 
number  were  found  to  be  associated  with  mutations 
in  the  KRAS  and  EGFR  oncogenes,  respectively.^®®  As 
mutations  in  EGFR  and  KRAS  occur  almost  mutually 
exclusively  in  lung  adenocarcinomas®  and  were  sug¬ 
gested  to  function  in  different  lineages  of  lung  adeno¬ 
carcinomas,^®®  it  is  possible  that  TITF-1  expression 
is  aberrantly  differently  controlled  within  different 
subsets  of  adenocarcinomas.  It  is  also  important 
to  note  that  TITF-1  copy- number  gain  was  also 

Respirology  {20^2)  17,  50-65 


60 


H  Kadara  et  al. 


demonstrated  in  lung  It  is  plausible  that 

TITF-1  copy  gain  may  only  be  a  surrogate  marker  in 
see  of  another  molecular  defect  in  a  gene  nearby  or 
within  the  14ql3.3  amplicon,  for  example,  NKX2.8  or 
PAX9.  The  significance  of  the  infrequent  copy  number 
increase  of  TITF-1  in  lung  See  remains  elusive. 


SOX2 

SOX2  was  suggested  to  play  key  developmental  roles 
in  the  formation  of  the  lung,  trachea  and  oesophagus 
based  on  its  expression  pattern  in  these  tissues 
and  organs. Interestingly,  SOX2  was  shown  to  be 
important  for  the  morphogenesis  of  the  trachea  and 
oesophagus,  and  the  differentiation  of  the  oesoph¬ 
ageal  epithelium. Moreover,  the  timing  of  SOX2 
expression  in  the  foregut  is  tightly  regulated,  as  it  is 
only  expressed  in  the  main  airways  and  non- 
branching  bronchioles  in  the  developing  and  adult 
mouse  Heterozygote  and  homozygote 

transgenic  mice  with  mutant  SOX2  have  substantial 
defects  in  lung  branching  and  morphogenesis  during 
development. Moreover,  SOX2  plays  key  roles  in 
the  maintenance  of  developing  and  adult  tracheal 
cells  evidenced  by  shorter  and  injured  trachea  in  mice 
with  knockout  of  both  alleles  of  the  transcriptional 
factor.  The  numerous  functions  SOX2  elicits  in  the 
differentiation  of  the  conducting  airways  among 
other  roles  are  reviewed  in  more  detail  by  Whitsett 
and  colleagues.  It  is  important  to  note  that  SOX2 
forms  a  core  transcriptional  factor  complex  with 
OCT4  or  OCTl  and  TirNaNog/ NANOG  that  binds  to 
enhancer  sequences  of  various  genes  to  regulate  the 
inner  cell  mass  or  embryoblast  within  the  blastocyst 
cavity  in  embryos. Moreover,  Boyer  and  colleagues 
demonstrated  that  SOX2  along  with  OCT4  and 
NANOG  form  a  core  regulatory  transcriptional  cir¬ 
cuitry,  signified  by  a  SOX2/OGT4/NANOG  expression 
signature,  consisting  of  autoregulatory  and  feed¬ 
forward  loops  for  the  pluripotency  and  self-renewal 
of  ESC.'"" 

As  mentioned  earlier  in  the  review,  various  studies 
have  demonstrated  that  amplification  of  chromo¬ 
somal  region  3q  (3q26.3)  is  almost  specific  to  lung 
Scc.^'’®"’®4.66,67  jYie  studies  by  Bass  etal  and  Hussenet 
and  colleagues  revealed  that  SOX2  is  amplified  in  this 
chromosomal  region  in  lung  SCC  and  squamous 
oesophageal  cancers  and  promotes  survival  of  SCC 
with  amplification  of  this  gene.""'"^  Subsequently, 
increased  SOX2  mRNA  levels  in  lung  SCC  relative  to 
adenocarcinomas  was  further  evidenced  by  effective 
separation  of  both  NSCLC  subtypes  by  the  previously 
characterized  OGT4/SOX2/NANOG  ESC  expression 
signature,'""  following  analysis  of  publicly  available 
NSCLC  microarray  datasets.""  In  addition,  SOX2 
immunohistochemical  protein  expression  was  com¬ 
pletely  absent  in  lung  adenocarcinoma  pathogenesis, 
highly  expressed  in  SCC  development  and  signifi¬ 
cantly  elevated  in  lung  SCC  relative  to  adenocarcino¬ 
mas.""  Interestingly,  Maeir  etal.  later  demonstrated 
that  S0X2  amplification  was  found  in  squamous  car¬ 
cinomas  originating  from  other  tissues  and  organs, 
such  as  those  of  the  cervix,  skin  and  penis.'""  It  is 
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noteworthy  that  SOX2  immunohistochemical  protein 
expression  in  lung  SCC  and  adenocarcinomas  was 
also  observed  by  other  groups  but  in  association 
with  clinicopathological  features  including  patient 
outcome.  Interestingly,  Wilbertz  etal.  reported  the 
association  of  S0X2  expression  with  favourable 
prognosis  in  lung  SCC.'""  On  the  other  hand,  Sholl 
and  colleagues  demonstrated  that  S0X2  immuno¬ 
histochemical  expression  was  an  indicator  of  poor 
prognosis  in  lung  adenocarcinomas.'"''  Despite  the 
equivocal  associations  of  SOA2  with  lung  cancer  prog¬ 
nosis,  various  studies  have  highlighted  tumour- 
promoting  roles  for  this  lineage-specific  oncogene  in 
lung  cancer.""  '""  '"" 

McCaughan  and  colleagues  specifically  analysed  3q 
copy-number  alteration  in  bronchial  dysplasia  of 
varying  grades  and  severity  and  demonstrated  that 
S0X2  amplification  was  present  in  high-grade  bron¬ 
chial  dysplasias  but  not  in  low-grade  lesions  and, 
importantly,  was  associated  with  clinical  progression 
of  high-grade  preinvasive  squamous  lesions.'^"  It  is 
important  to  mention  that  Yuan  et  al.  had  found  rela¬ 
tively  high  S0X2  immunohistochemical  protein 
expression  in  normal  bronchial  epithelia  and  alveolar 
bronchiolarization  structures.""  Congruent  with  the 
study  by  Yuan  et  al.,  the  results  by  McCaughan  and 
colleagues  demonstrated  the  implication  of  SOX2  in 
the  early  pathogenesis  of  lung  SCC.""  '^"  Given  the  high 
SOX2  protein  expression  in  histologically  normal 
bronchial  epithelia,  amplification  of  SOX2  in  high- 
grade  dysplasia  may  exacerbate  signalling  down¬ 
stream  of  this  transcriptional  factor  in  the  course  of 
SCC  development.  It  is  unknown  whether  SOX2  may 
be  amplified  in  normal  bronchial  epithelia,  in  par¬ 
ticular,  those  adjacent  to  lung  SCC  with  increased 
dosage  of  the  gene.  The  findings  outlined  earlier  dem¬ 
onstrate  that  SOX2  is  another  cell-lineage  oncogene 
with  dissimilar  functions  between  SCC  and  lung 
adenocarcinomas. 


FUTURE  DIRECTIONS 

Lung  adenocarcinoma  genomics 

Studies  addressing  genomic  profiles,  including  copy- 
number  alterations  and  mutational  spectrums,  have 
substantially  increased  our  understanding  of  the 
molecular  make-up  and  biology  of  lung  adenocarci¬ 
nomas  demonstrating  that,  genomically,  this  subtype 
of  NSCLC  is  different  from  SCC.  However,  the  hetero¬ 
geneity  within  lung  adenocarcinomas  is  still  poorly 
understood.  Eor  example,  it  is  unknown  whether,  for 
example,  genomic  copy-number  alterations  found  in 
never-smoker  adenocarcinomas  are  unique  to  this 
subtype  or  whether  they  are  also  found  in  smoker 
tumours.  A  large-scale  side-by-side  genomic  analysis 
of  never-smoker  and  smoker  lung  adenocarcinomas 
would  shed  light  on  copy- number  alterations  unique 
to  both  subtypes  of  lung  adenocarcinomas.  Moreover, 
it  is  not  clear  whether  certain  copy- number  alter¬ 
ations  can  be  clinically  exploited  for  targeted  therapy 
of  lung  adenocarcinoma.  An  important  step  in  this 
direction  was  the  demonstration  by  Yuan  et  al.  that 
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lung  adenocarcinomas  with  mutant  EGFR  and  ampli¬ 
fication  of  specific  genes  within  the  7p  region  predict 
poor  response  to  EGFR  targeting  tyrosine-kinase 
inhibitors/^  It  is  tempting  to  speculate  that  an 
orthogonal  study,  largely  encompassing  both  copy- 
number  alterations  and  mutational  spectrum  and 
detecting  focal  amplification  of  oncogenes  and  loss  of 
tumour- suppressor  genes,  would,  for  example,  high¬ 
light  potential  targets  of  therapy  in  EGFR,  KRAS  and 
ALiC wild-type  lung  adenocarcinomas  for  which  there 
is  an  unmet  need  for  therapeutic  strategies. 


Next-generation  sequencing 

Next-generation  sequencing  (NGS)  technology, 
through  whole-genome,  whole-exome  and  whole- 
transcriptome  approaches,  holds  great  promise  for 
providing  invaluable  insights  into  lung  adenocarci¬ 
noma  biology,  diagnosis,  prevention  and  therapy. 
NGS  enables  the  sequencing  of  expressed  genes, 
exons  and  complete  genomes  providing  data  on  levels 
of  expression  with  a  substantially  larger  dynamic 
range  compared  with  array  technology,  sequence 
alterations,  single  nucleotide  variations,  as  well  as 
structural  genomic  aberrations. A  handful  of 
studies  have  successfully  applied  NGS  approaches  to 
sequence  one  or  two  human  lung  tumour  samples 
or  cell  lines  demonstrating  the  feasibility  of  system¬ 
atic,  genome-wide  characterization  of  rearrange¬ 
ments  and  alterations  in  complex  human  cancer 
genomes. NGS  analysis  of  a  significant  number 
of  lung  adenocarcinomas  and/or  NSCLC  with  charac¬ 
terized  mutational  status  of  known  oncogenes  (e.g. 
EGFR  and  KRAS)  undoubtedly  represents  an  im¬ 
portant  next  step  in  furthering  our  comprehension 
of  lung  cancer  biology.  However,  the  application 
of  NGS  technology  in  clinical  decision-maMng  and 
personalized  medicine  is  yet  challenging. 


Field  cancerization  and  lung 
adenocarcinoma  pathogenesis 

Applying  the  same  advanced  high-throughput  meth¬ 
odologies  currently  used  in  studying  established 
tumours  for  the  genetic  analysis  of  lung  adenocarci¬ 
noma  preneoplasia  and  intraepithelial  lesions,  as  well 
as  histologically  normal  adjacent  regions,  is  expected 
to  expand  our  understanding  of  the  biology  of  this 
prevalent  disease.  An  important  step  in  this  direction 
was  a  recent  study  by  Beane  et  al.  in  which  RNA  of 
bronchial  airway  epithelial  cell  brushings  from 
healthy  never- smokers  and  smokers  with  and  without 
lung  cancer  was  analysed  by  RNA  sequencing. The 
study  highlighted  transcripts  whose  expression  was 
either  not  interrogated  by  or  was  not  found  to  be  sig¬ 
nificantly  altered  when  using  micro  arrays  demon¬ 
strating  that  NGS,  like  in  established  lung  tumours, 
has  the  potential  to  provide  new  insights  into  the 
biology  of  the  airway  field  cancerization  associated 
with  smoking  and  lung  cancer. 

Earlier  findings  demonstrated  that  centrally  located 
lung  see  and  peripherally  located  lung  adenocarci¬ 
nomas  elicit  and  perpetuate  differential  effects  on  the 
airway  epithelia.  We  believe  that  these  effects  overlap 
with  those  of  the  response  of  the  host  to  tobacco 
exposure  (reviewed  by  Steiling  et  al}^)  but  may  be 
unique  in  several  aspects.  Changes  in  expression  in 
the  lung  field  of  injury  have  shown  to  be  similar  in  the 
large  and  small  airways,  and  it  is  unknown  whether 
they  are  associated  with  the  development  of  the  par¬ 
ticular  subtype  of  NSCLC.  Addressing  this  question 
may  be  highly  pertinent  because  both  NSCLC  sub- 
types  display  different  genomic  features,  as  previously 
discussed,  and,  therefore,  are  clinically  managed  by 
significantly  dissimilar  treatment  strategies,  let  alone 
differences  among  various  subtypes  of  lung  adeno¬ 
carcinomas.  Moreover,  a  compartmental  approach  in 
studying  the  field  of  cancerization  (Lig.  1)  will  shed 


Figure  1  Molecular  analysis  of 
the  lung  field  cancerization.  It  is 
unknown  whether  changes  in 
expression  in  the  lung  field  cancer¬ 
ization  are  associated  with  the  devel¬ 
opment  of  a  particular  subtype  of 
non-small  cell  lung  cancer  (NSCLC), 
that  is,  adenocarcinomas  compared 
with  squamous-cell  carcinomas 
(SCO.  Analysing  local  and  distant 
field  cancerization  independently 
for  lung  adenocarcinomas  and  SCC 
may  shed  light  on  events  common 
or  unique  to  the  molecular  patho¬ 
genesis  of  the  two  major  subtypes 
of  NSCLC.  Such  a  'compartmental' 
approach  in  studying  the  field  can¬ 
cerization  may  unravel  biomarkers 
that  can  guide  personalized  preven¬ 
tion  strategies  suitable  for  each 
different  NSCLC  subtype. 


Copy  number  and 
mutations  analyses 


mRNAand 

miRNAprofiling 


Immu  n  ohistochemistry 
andproteomic 
analyses 
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light  on  events  in  the  early  pathogenesis  of  lung 
adenocarcinomas  versus  SCC  and  unravel  biomark¬ 
ers  that  can  be  lineage  specific  and  can  guide  person¬ 
alized  chemoprevention  strategies  suitable  for  each 
different  NSCLC  subtype,  which  may  reduce  the  rela¬ 
tively  high  frequency  of  relapse  of  early  stage  patients. 

PERSPECTIVE 

Despite  numerous  efforts  that  have  focused  on 
increasing  our  understanding  of  the  biology  of  lung 
adenocarcinomas,  this  subtype  of  NSCLC  that  is 
increasing  in  incidence  compared  with  SCC,  consti¬ 
tutes  for  approximately  half  of  lung  cancer  deaths 
each  year,  which  in  turn  comprise  the  biggest  share  of 
cancer- related  deaths  in  the  United  States  and  world¬ 
wide.  Compared  with  advances  in  targeted  and  per¬ 
sonalized  therapy  of  lung  adenocarcinomas,  little 
progress  has  been  made  in  the  tailored  prevention 
of  this  fatal  malignancy  leading  to  a  substantially 
decreased  enthusiasm.  This  may  change  with  the 
recent  encouraging  and  significant  findings  of  the 
NLST.  Various  molecular  markers  and  expression 
classifiers  previously  described  in  the  lung  airways 
and  in  less  invasive  sites  of  the  field  cancerization,  for 
example,  nasal,  sputum  and  exhaled  breath  conden¬ 
sates,  can  aid  in  selecting  high-risk  individuals  best 
suited  for  CT  screening  for  example.  A  comprehensive 
analysis  of  early  molecular  events  in  lung  adenocar¬ 
cinoma  pathogenesis  will  undoubtedly  unravel 
biomarkers  that  can,  in  the  future,  aid  prevention 
through  personalized  strategies,  deliver  its  longstand¬ 
ing  promise  to  oppose  this  disease. 
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Lung  cancer,  of  which  non-small  cell  lung  cancer  (NSCLC)  composes 
the  majority,  is  the  leading  cause  of  cancer-related  deaths  in  the 
United  States  and  worldwide.  NSCLCs  are  tumors  with  complex  bi¬ 
ology  that  we  have  recently  started  to  understand  with  the  advent 
of  various  histological,  transcriptomic,  genomic,  and  proteomic 
technologies.  However,  the  histological  and  molecular  pathogene¬ 
sis  of  this  malignancy,  in  particular  of  adenocarcinomas,  is  still 
largely  unknown.  Earlier  studies  have  highlighted  a  field  canceriza¬ 
tion  phenomenon  in  which  histologically  normal-appearing  tissue 
adjacent  to  neoplastic  and  pre-neoplastic  lesions  display  molecular 
abnormalities,  some  of  which  are  in  common  with  those  in  the 
tumors.  This  review  will  summarize  advances  in  understanding  the 
field  cancerization  phenomenon  and  the  potential  relevance  of  this 
knowledge  to  gain  important  and  novel  insights  into  the  molecular 
pathogenesis  of  NSCLC  as  well  as  to  subsequent  development  of 
biomarkers  for  early  detection  of  lung  cancers  and  possibly  person¬ 
alized  prevention. 

Keywords:  lung  cancer;  field  cancerization;  pathogenesis;  airway  epi¬ 
thelium 

Lung  cancer  is  the  leading  cause  of  cancer  deaths  in  the  United 
States  and  worldwide  in  both  developing  and  developed  regions 
(1).  The  high  mortality  of  this  disease  is  in  part  due  to  our 
lacking  knowledge  of  the  molecular  mechanisms  governing  lung 
cancer  pathogenesis  as  well  as  the  late  diagnosis  of  the  majority 
of  lung  cancers  after  regional  or  distant  spread  of  the  malig¬ 
nancy  (2).  Non-small  cell  lung  cancer  (NSCLC)  represents  the 
majority  of  diagnosed  lung  cancers  (2)  and  is  mainly  composed 
of  squamous  cell  carcinomas  (SCCs)  and  lung  adenocarcinomas 
(2, 3).  Several  major  differences  exist  between  adenocarcinomas 
and  SCCs.  For  example,  compared  with  SCCs  that  arise  from 
the  major  bronchi  and  are  mainly  centrally  located,  pulmonary 
adenocarcinomas  arise  from  small  bronchi,  bronchioles,  or  al¬ 
veolar  epithelial  cells  and  are  typically  peripherally  located,  as 
reviewed  elsewhere  (2-5).  Moreover,  whereas  SCC  pathogenesis 
is  strongly  linked  to  smoking,  lung  adenocarcinoma  is  the  more 
common  histological  subtype  in  never-smoker  patients  (6-9).  Al¬ 
though  the  sequence  of  lesions  in  the  pathogenesis  leading  to 
SCCs  is  well  described,  little  is  known  about  the  sequential  de¬ 
velopment  of  adenocarcinomas.  Moreover,  we  are  still  lacking  in 
our  knowledge  of  differential  mechanisms  of  molecular  patho¬ 
genesis  among  both  subtypes  of  NSCLC. 
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In  light  of  the  postulated  period  of  time  and  multiple  stages  re¬ 
quired  for  the  development  of  overt  epithelial  tumors,  it  is  plau¬ 
sible  to  assume  that  early  diagnosis  of  lung  cancer  or  intraepithelial 
lesions  coupled  with  effective  prevention  strategies  will  reduce  the 
significant  health  burden  associated  with  this  disease  (10).  Despite 
recent  encouraging  findings  from  the  National  Lung  Screening 
Trial  (NLST)  (11),  early  detection  and  prevention  of  lung  cancer 
is  challenging  due  to  the  lack  of  biomarkers  for  early  diagnosis  of 
the  disease  and  to  the  presence  of  multiple  neoplastic  molecular 
pathways  that  mediate  lung  carcinogenesis.  Earlier  studies  have 
highlighted  a  field  cancerization  phenomenon  in  which  histolog¬ 
ically  normal-appearing  tissue  adjacent  to  neoplastic  and  pre¬ 
neoplastic  lesions  displays  molecular  abnormalities,  some  of  which 
are  in  common  with  those  in  the  tumors.  It  is  plausible  to  assume 
that  understanding  early  events  in  tumor  development  that  com¬ 
mence  in  histologically  normal  epithelium  would  pave  the  way  for 
unmet  effective  and  personalized  strategies  for  lung  cancer  pre¬ 
vention  and  treatment.  This  review  mainly  summarizes  advances 
in  understanding  the  field  cancerization  phenomenon  and  the  po¬ 
tential  relevance  of  this  knowledge  to  gain  important  and  novel 
insights  into  the  molecular  pathogenesis  of  NSCLC  as  well  as  to 
subsequent  development  of  biomarkers  for  early  detection  of  lung 
cancers  and  possibly  personalized  prevention. 

NSCLC  PRE-NEOPLASIA  AND 
MOLECULAR  PATHOGENESIS 

From  biological  and  histopathological  perspectives,  NSCLC  is 
a  complex  malignancy  that  develops  through  multiple  pre-neoplastic 
pathways.  Lung  adenocarcinoma,  a  major  subtype  of  NSCLC,  has 
been  increasing  in  incidence  globally  in  both  smokers  and  non- 
smokers  (9),  with  a  concurrent  decrease  in  SCC  frequency.  It 
has  been  suggested  that  the  increasing  incidence  of  lung  adeno¬ 
carcinomas  compared  with  SCCs  is  in  part  due  to  the  change  in 
the  type  of  cigarettes  used  (lower  nicotine  and  tar)  and  smoking 
habits  and  behavior  (7).  Anatomical  differences  in  the  location  of 
diagnosed  lung  adenocarcinomas  and  SCCs  strongly  suggest  that 
both  NSCLC  subtypes  develop  through  different  histopathologi¬ 
cal  and  molecular  pathways  and  have  different  cells  of  origin; 
however,  the  specific  respiratory  epithelial  cell  type  from  which 
each  lung  cancer  type  develops  has  not  been  established  with 
certainty  (3).  However,  it  is  noteworthy  that  Clara  cells  and  the 
type  II  pneumocytes  are  believed  to  be  the  progenitor  cells  of  the 
peripheral  airways,  and  peripherally  arising  adenocarcinomas  of¬ 
ten  express  markers  of  these  cell  types  (12,  13). 

The  multistage  stepwise  fashion  of  tumor  development  has  been 
demonstrated  in  various  anatomical  organs  exemplified  by  the  op¬ 
erational  stages  occurring  during  skin  carcinogenesis  (14).  Carcino¬ 
genesis  of  the  skin  is  initiated  by  a  carcinogen-induced  mutational 
event,  promoted  by  clonal  outgrowth,  which  may  be  dependent  on 
tumor  promoters,  followed  by  progression  of  premalignant  lesions 
(e.g.,  papillomas)  and  their  conversion  to  malignant  tumors  (14). 
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Importantly,  earlier  genetic  studies  indicate  that  nonrandom, 
sequential  chromosomal  aberrations  (trisomies  of  chromosomes  6 
and  7)  are  associated  with  premalignant  progression  of  mouse  skin 
papillomas  (14-17).  The  sequence  of  histopathological  and  molec¬ 
ular  changes  in  bronchial  epithelia  that  precede  the  development 
of  lung  SCCs  have  been  characterized,  demonstrating  that  sequen¬ 
tially  occuii'ing  allele -specific  molecular  changes  commence  in  dis¬ 
persed  foci  signifying  a  multistage  fashion  of  squamous  lung 
cancer  pathogenesis  (4,  18).  At  least  in  this  subtype  of  NSCLC, 
it  has  been  shown  that  specific  genomic  alterations  (mainly  3p21, 
3p22-24,  3p25,  and  9p21)  occur  in  histologically  normal  bronchial 
epithelia  from  resected  specimens  (18).  Moreover,  and  notably, 
these  alterations  persisted  in  hyperplasias,  dysplasias,  carcinomas 
in  situ,  and  tumors  that  exhibited  different  commencing  genomic 
aberrations  (18).  In  addition,  loss  of  heterozygosity  (LOH)  in 
the  3p  region  was  demonstrated  in  normal  bronchial  epithelia  of 
cancer-free  smokers,  further  highlighting  the  early  role  this  specific 
genomic  alteration  exerts  in  lung  cancer  pathogenesis  (19,  20). 
It  is  noteworthy  that  McCaughan  and  colleagues  demon¬ 
strated  that  no  low-grade  lesions,  but  all  high-grade  lesions, 
exhibited  3q  amplification  targeting  the  sex  determining  re¬ 
gion  Y-box  2  {SOX2)  lineage -specific  oncogene  (21).  These 
previous  findings  and  reports,  through  highlighting  associations 
between  histopathological  sequences  and  specific  molecular 
aberrations,  pinpoint  to  a  multistage  and  multistep  manner  of 
lung  carcinogenesis. 

On  the  other  hand,  one  cannot  neglect  the  alternative  hypoth¬ 
esis  that  the  sequence  of  genetic  and  epigenetic  alterations  is 
irrelevant  to  lung  cancer  pathogenesis,  but  rather  the  accumula¬ 
tion  of  molecular  abnormalities  beyond  a  certain  threshold  medi¬ 
ates  development  of  the  malignant  phenotype.  It  has  been 
suggested  that  at  least  two  molecular  pathways,  the  Kirsten  rat 
sarcoma  viral  oncogene  (KRAS)  and  epidermal  growth  factor 
receptor  (EGFR)  pathways,  are  involved  in  the  development 
of  smoker  and  never-smoker  adenocarcinomas,  respectively  (2, 
9).  Moreover,  and  as  reviewed  by  Yatabe  and  colleagues,  atyp¬ 
ical  adenomatous  hyperplasias  (AAHs),  which  are  considered  to 
be  precursor  lesions  for  peripheral  lung  adenocarcinomas  (3,  5) 
and  the  only  sequence  of  morphologic  change  identified  so  far 
for  the  development  of  invasive  lung  adenocarcinomas,  exhibit 
KRAS  mutations  more  frequently  than  invasive  adenocarcino¬ 
mas  (5).  Conversely,  our  group  has  previously  demonstrated  that 
EGFR  mutations  commence  in  histologically  normal  bronchial 
epithelia  adjacent  to  lung  adenocarcinomas  and  precede  copy 
number  increase  of  the  oncogene  (22,  23).  EGFR  mutations  also 
are  persistent  throughout  the  different  phases  of  lung  adenocar¬ 
cinoma  development  (5),  which  harbor  different  genomic  alter¬ 
ations  (24).  It  is  plausible  to  surmise  that  only  after  increasing 
our  knowledge  of  the  pre-neoplastic  changes  as  well  as  the  cor¬ 
responding  molecular  abnormalities  leading  to  the  development 
of  lung  adenocarcinomas  would  we  then  be  able  to  more  confi¬ 
dently  determine  whether  adenocarcinomas  follow  a  linear  pro¬ 
gression  mechanism  or  not  (5).  However,  and  based  on  the 
aforementioned  previously  reported  observations  by  our  group 
and  others,  we  believe  that  it  is  not  counterintuitive  to  speculate 
that  development  of  lung  malignant  phenotype,  including  that  of 
adenocarcinomas,  is  due  to  stepwise,  sequence-specific,  and  mul¬ 
tistage  molecular  pathogenesis  as  well  as  accumulation  and  com¬ 
bination  of  genetic  and  epigenetic  abnormalities. 

FIELD  CANCERIZATION 

Smoking-Damaged  Epithelium  and  the  Field 
Cancerization  Phenomenon 

Although  the  majority  of  lung  cancer  patients  are  current  or  for¬ 
mer  smokers,  a  relatively  small  proportion  of  these  smokers 


(approximately  15%)  develop  primary  lung  tumors.  Patients  with 
early-stage  NSCLC  commonly  exhibit  recurrence  or  second  pri¬ 
mary  tumor  development  after  definitive  treatment  by  resective 
surgery.  There  is  a  large  body  of  evidence  that  heavy  smokers 
and  patients  who  have  survived  lung  cancer  compose  a  high- 
risk  population  that  may  be  targeted  for  early  detection  and  che- 
moprevention  efforts  (4).  Although  the  risk  of  developing  lung 
cancer  decreases  after  smoking  cessation,  the  risk  never  returns 
to  baseline.  Pre-neoplastic  changes  have  been  used  as  surrogate 
endpoints  for  chemopreventive  studies.  However,  it  was  sug¬ 
gested  that  this  “shooting  in  the  dark”  approach  may  explain 
the  reasons  behind  the  general  failures  of  clinical  chemopreven- 
tion  studies  (10).  Therefore,  novel  approaches  to  identify  the  best 
population  to  be  targeted  for  early  detection  and  chemopreven- 
tion  should  be  devised,  and  risk  factors  for  lung  cancer  develop¬ 
ment  or  relapse  need  to  be  better  defined.  For  these  important 
purposes,  a  better  understanding  of  the  biology  and  molecular 
origins  of  lung  cancer  is  warranted. 

Earlier  work  by  Danely  Slaughter  in  patients  with  oral  cancer 
and  oral  premalignant  lesions  has  suggested  that  histologically 
normal-appearing  tissue  adjacent  to  neoplastic  and  pre- neoplastic 
lesions  display  molecular  abnormalities,  some  of  which  are  in 
common  with  those  in  the  tumors  (25).  In  1961,  a  seminal  report 
by  Auerbach  and  colleagues  suggested  that  cigarette  smoke 
induces  extensive  histological  changes  in  the  bronchial  epithelia 
in  the  lungs  of  smokers  and  that  premalignant  lesions  are  wide¬ 
spread  and  multifocal  throughout  the  respiratory  epithelium, 
suggestive  of  a  field  effect  (26).  This  phenomenon,  coined  “field 
of  cancerization,”  was  later  shown  to  be  evident  in  various 
epithelial  cell  malignancies,  including  lung  cancer.  Some  degree 
of  inflammation  and  inflammatory-related  damage  is  almost  in¬ 
variably  present  in  the  central  and  peripheral  airways  of  smok¬ 
ers  and  may  precede  the  development  of  lung  cancer  (4).  Thus, 
the  field  of  cancerization  may  also  be  explained  by  both  the 
direct  effect  of  tobacco  carcinogens  and  the  initiation  of  inflam¬ 
matory  response.  In  this  context,  different  theories  for  the  ori¬ 
gin  of  the  field  cancerization  or  smoking-related  field  of  injury 
have  been  put  forward  and  extensively  reviewed  elsewhere  by 
Steiling  and  co workers  (27). 

Multiple  altered  foci  of  bronchial  epithelium  are  present 
throughout  the  airway  in  patients  with  lung  cancer  and  smokers 
(18,  28,  29).  As  mentioned  before,  detailed  analysis  of  histol¬ 
ogically  normal,  premalignant,  and  malignant  epithelia  from 
patients  with  lung  SCC  indicated  that  multiple,  sequentially  oc¬ 
curring  allele-specific  chromosomal  deletions  of  LOH  commence 
in  clonally  independent  foci  early  in  the  multistage  pathogenesis 
of  SCCs  (18,  28).  Notably,  31%  percent  of  histologically  normal 
epithelium  and  42%  of  mildly  abnormal  (hyperplasia/metaplasia) 
specimens  had  clones  of  cells  with  allelic  loss  at  one  or  more 
regions  examined.  Moreover,  these  molecular  aberrations  were 
also  found  in  carcinomas  in  situ  and  SCCs  and  at  a  more  ad¬ 
vanced  level  (18).  Molecular  changes  involving  LOH  of  chromo¬ 
somal  regions  3p  (e.g.,  fragile  histidine  triad  gtndFHIT),  9p 
(e.g.,  CDKN2A/pl6),  genomic  instability  (increased  microsatel¬ 
lite  repeats),  and  pl6  methylation  have  been  demonstrated  in 
histologically  normal  bronchial  epithelia  in  patients  with  SCC 
and  in  the  sequence  of  pathogenesis  of  the  disease  (3).  Moreover, 
Nelson  and  colleagues  demonstrated  that  KRAS  is  also  mutated 
in  histologically  normal  lung  tissue  adjacent  to  lung  tumors  (30). 
In  addition,  similar  epigenetic  and  gene  methylation  patterns 
between  tumors  and  adjacent  histologically  normal  epithelia 
were  described.  Belinsky  and  colleagues  reported  aberrant  pro¬ 
moter  methylation  of  pl6,  which  was  described  to  be  commonly 
methylated  in  lung  tumors  (31),  in  at  least  one  bronchial  epithe¬ 
lial  site  from  44%  of  lung  cancer  cases  examined  (32).  Moreover, 
pl6  and  death- associated  protein  kinase  (DARK)  promoter 
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methylation  were  frequently  observed  in  bronchial  epithelium 
from  smoker  but  not  from  never-smoker  patients  with  lung  can¬ 
cer  and  persisted  after  smoking  cessation  (32). 

The  aforementioned  molecular  abnormalities  were  detected 
in  histologically  normal  epithelia  adjacent  to  archival  surgically 
resected  tumors  from  patients  with  primary  lung  cancer.  LOH 
and  microsatellite  alterations  in  multiple  foci  were  also  detected 
in  distal  histological  normal  bronchial  epithelia  of  smokers  with¬ 
out  cancer  (19,  20).  Moreover,  and  importantly,  these  molecular 
abnormalities  were  detected  in  bronchial  epithelia  of  cancer- 
free  former  smokers  and  appeared  to  have  persisted  for  many 
years  after  smoking  cessation.  In  addition,  LOH  was  detected  in 
DNA  obtained  from  bronchial  brushings  of  normal  and  abnor¬ 
mal  lungs  from  patients  undergoing  diagnostic  bronchoscopy 
and  was  detected  in  cells  from  the  ipsilateral  and  contralateral 
lung  (33).  Mutations  in  TP53  were  also  described  to  occur  in 
bronchial  epithelia  of  cancer-free  smokers  in  a  widely  dispersed 
manner  (34).  Similar  evidence  also  exists  for  promoter  methyl¬ 
ation  and  epigenetic  changes  in  smoking-damaged  lung  epithe¬ 
lium  of  cancer-free  patients.  Methylation  of  various  genes, 
including  retinoic  acid  receptor  2  |3  (RAR-P2),  H-cadherin,  ad¬ 
enomatous  polyposis  coli  (APC),  pl6,  and  Ras  association 
(RalGDS/AF-6)  domain  family  member  1  (RASSFFIA)  has 
been  described  in  bronchial  epithelial  cells  of  heavy  smokers 

(35) .  Moreover,  methylation  of  pl6,  glutathione  S-transf erase 
pi  1  (GSTPl),  and  DAPK  was  reported  to  be  evident  in  bron¬ 
chial  brushings  of  one-third  of  cancer-free  smokers  examined 

(36) .  In  the  study  by  Belinsky  and  colleagues,  as  mentioned  be¬ 
fore,  methylation  of  pl6  was  detected  in  epithelia  of  cancer-free 
smokers  (32).  A  more  detailed  list  of  aberrant  gene  promoter 
methylation  in  patients  with  lung  cancer  and  cancer-free  smok¬ 
ers  is  well  summarized  and  explained  in  the  review  by  Heller 
and  CO  workers  (37). 


Transcriptomic  Studies  of  Lung  Field  Cancerization 

High-throughput  microarray  profiling  was  shown  to  be  useful  to 
study  the  transcriptome  of  lung  airways.  Hackett  and  colleagues 
studied  the  expression  of  44  antioxidant-related  genes  using 
bronchial  brushings  from  cancer-free  current  smokers  and 
never-smokers  and  found  significant  up-regulation  of  16  of  the 
antioxidant  genes  in  the  airways  of  smokers  compared  with  non- 
smokers  (38).  Later,  Spira  and  colleagues  described  global  alter¬ 
ations  in  gene  expression  between  normal- appearing  bronchial 
epithelium  of  healthy  cancer-free  smokers  and  that  of  non- 
smokers  (39).  Importantly,  irreversible  changes  in  expression 
in  airways  of  former  smokers  after  years  of  smoking  cessation 
were  described  that  were  believed  to  underlay  the  increased  risk 
former  smokers  exhibit  for  developing  lung  cancer  (39,  40). 
Alterations  in  the  expression  of  microRNAs  were  also  demon¬ 
strated  between  large  airways  of  current  and  never-smokers 
(41).  Notably,  an  80-gene  signature  was  derived  from  the  tran¬ 
scriptome  of  large  airway  epithelial  cells  that  can  distinguish 
smokers  without  overt  cancer  from  smokers  with  lung  cancer 
despite  originating  from  normal  bronchial  epithelia  (42).  More 
recently,  Gustafson  and  coworkers  derived  a  phosphoinositide- 
3-kinase  {PI3K)  pathway  activation  signature  by  using  recombi¬ 
nant  adenoviruses  to  express  the  IlOa  subunit  of  PI3K  in  primary 
human  epithelial  cells  (43).  The  PI3K  pathway  activation  sig¬ 
nature  was  elevated  in  cytologically  normal  bronchial  airways  of 
smokers  with  lung  cancer  and,  importantly,  was  decreased  in  the 
airways  of  high-risk  smokers  whose  dysplastic  lesions  regressed 
after  treatment  with  the  PI3K  inhibitor  myoinositol  (43).  Micro¬ 
array  and  gene  expression  profiling  methodologies  were  also 
used  to  demonstrate  the  wide  anatomical  spread  of  the  lung 
field  cancerization.  Common  gene  expression  alterations  were 
identified  in  bronchial,  nasal,  and  buccal  epithelia  of  smokers 
(44),  and  in  a  separate  study,  the  expression  of  1 19  genes  was 
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Figure  7.  Molecular  mapping 
analysis  of  the  field  canceriza¬ 
tion  in  non-small  cell  lung  can¬ 
cer  (NSCLC).  The  relevance  of 
the  lung  field  cancerization  to 
the  development  of  a  partic¬ 
ular  subtype  of  NSCLC  (i.e., 
adenocarcinoma  compared  with 
squamous  cell  carcinoma  [SCC]), 
is  still  unknown,  yet  possible.  An¬ 
alyzing  local  and  distant  field  of 
cancerization  by  analysis  of  the 
transcriptome  of  airway  brush¬ 
ings  from  multiple  sites  indepen¬ 
dently  for  lung  adenocarcinoma 
(yellow  spots)  and  SCC  (red  spots) 
cases  may  shed  light  on  events 
common  or  unique  to  the  mo¬ 
lecular  pathogenesis  of  the  two 
major  subtypes  of  NSCLC.  A 
"compartmental"  approach  cou¬ 
pled  to  a  gradient  or  detailed 
molecular  mapping  method, 
which  spans  the  tumor  up  to 
the  nasal  or  buccal  epithelium, 
to  studying  the  field  of  canceriza¬ 
tion  may  unravel  biomarkers  that 
can  guide  personalized  preven¬ 
tion  strategies  suitable  for  each  different  NSCLC  subtype.  In  addition,  a  comparison  of  the  distant  field  cancerization  in  patients  with  cancer  (left)  to  the 
expression  patterns  of  the  corresponding  anatomical  location  in  disease-free  individuals  (e.g.,  high-risk  heavy  smokers;  right)  would  facilitate  the  development 
of  efficacious  markers  for  the  detection  of  NSCLC.  ADC  =  adenocarcinoma;  miRNA  =  microRNA. 
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demonstrated  to  be  affected  by  smoking  similarly  in  both  bron¬ 
chial  and  nasal  epithelium  (45). 

Field  Cancerization  Compartmentalization 

In  light  of  the  prevalence  of  mutations  in  the  EGFR  oncogene  in 
adenocarcinomas  and  in  particular  those  occurring  in  never- 
smokers,  Tang  and  colleagues  investigated  the  presence  of 
EGFR  mutations  in  normal  bronchial  and  bronchiolar  epithe¬ 
lium  adjacent  to  EGFR  mutant  tumors  (22).  EGFR  mutations 
were  detected  in  histologically  normal  peripheral  epithelia  in 
44%  of  patients  with  lung  adenocarcinoma  with  mutations  but 
none  in  patients  lacking  mutations  in  the  oncogene  (22).  More¬ 
over,  the  same  study  highlighted  more  frequent  EGFR  muta¬ 
tions  in  normal  epithelium  within  the  tumor  (43%)  than  in 
adjacent  sites  (24%),  suggesting  a  localized  field  effect  phenom¬ 
enon  for  this  abnormality  in  the  respiratory  epithelium  of  the 
lung  (22).  In  addition,  a  higher  frequency  of  mutations  in  cells 
obtained  from  small  bronchi  (35%)  compared  with  bronchioles 
(18%)  was  detected  (22).  More  recently,  EGFR  protein  over¬ 
expression,  similar  to  mutation  of  the  gene,  also  exhibited  a  lo¬ 
calized  field  effect,  as  it  was  more  frequent  in  normal  bronchial 
epithelia  sites  within  tumors  than  in  sites  adjacent  to  and  distant 
from  tumors  (23).  Interestingly,  EGFR  copy  number  alteration 
was  not  evident  in  normal  bronchial  epithelia,  which  is  in  ac¬ 
cordance  with  findings  that  EGFR  copy  number  is  a  relatively 
late  event  in  pathogenesis  of  adenocarcinomas  (5,  23).  These 
findings  suggest  that  adenocarcinomas  may  be  associated  with 
a  field  cancerization  dissimilar  from  that  linked  to  SCCs. 

The  low  frequency  of  molecular  abnormalities  detected  in  the 
centrally  located  bronchial  respiratory  epithelium  in  patients 
with  peripheral  lung  adenocarcinomas,  compared  with  specimens 
from  patients  with  SCCs  (28),  suggests  the  presence  of  two  com¬ 
partments  in  the  lung  with  different  degrees  of  smoking-related 
genetic  damage.  Thus,  smokers  who  develop  SCCs  display  more 
smoking-related  genetic  damage  in  the  respiratory  epithelium  of 
the  central  airway,  whereas  patients  who  develop  adenocarci¬ 
nomas  exhibit  molecular  and  histological  damage  mainly  in  the 
peripheral  airways.  Although  some  molecular  changes  (e.g., 
inflammation  and  signaling  pathways  activation)  have  been 
detected  throughout  the  lung  airway  and  include  both  compart¬ 
ments  (central  and  peripheral  airway),  other  aberrations  have 
been  more  frequently  altered  in  either  central  (e.g.,  LOH,  ge¬ 
netic  instability  evidenced  by  microsatellite  repeats)  or  periph¬ 
eral  (e.g.,  EGFR  mutations  as  mentioned  above)  airways.  These 
interesting  observations  indicate  a  possible  compartmentaliza¬ 
tion  of  held  cancerization  and  its  dissimilarity  between  adeno¬ 
carcinomas  and  SCCs,  which  may  well  reflect  the  differential 
mechanisms  of  pathogenesis  of  both  NSCLC  subtypes. 

FUTURE  DIRECTIONS  AND  CONCLUSION 

Applying  the  same  advanced  high-throughput  methodologies 
currently  used  in  studying  established  tumors  for  the  genetic 
analysis  of  lung  NSCLC  pre-neoplasia  and  histologically  normal 
adjacent  regions  is  expected  to  expand  our  understanding  of  the 
biology  of  this  prevalent  disease.  Next-generation  sequencing 
technology,  through  whole-genome,  whole-exome,  and  whole- 
transcriptome  approaches,  holds  great  promise  for  providing  in¬ 
valuable  insights  into  NSCLC  biology,  diagnosis,  prevention,  and 
therapy  (46).  An  important  step  in  this  direction  was  a  recent 
study  in  which  RNA  of  bronchial  airway  epithelial  cell  brush- 
ings  from  healthy  never-smokers  and  smokers  with  and  without 
lung  cancer  was  analyzed  by  RNA  sequencing  (47)  and  provided 
additional  insight  besides  that  provided  when  using  microarray 
technology. 


Earlier  findings  demonstrated  that  centrally  located  lung 
SCCs  and  peripherally  located  lung  adenocarcinomas  elicit 
and  perpetuate  differential  effects  on  the  airway  epithelia  (4). 
Changes  in  expression  in  the  lung  field  of  injury  have  been  shown 
to  be  similar  in  the  large  and  small  airways,  and  it  is  unknown 
whether  they  are  associated  with  the  development  of  the  partic¬ 
ular  subtype  of  NSCLC.  Addressing  this  question  may  be  highly 
important,  because  both  NSCLC  subtypes  display  different  ge¬ 
nomic  features  (2)  and,  therefore,  are  clinically  managed  by  sig¬ 
nificantly  dissimilar  treatment  strategies,  let  alone  differences 
among  various  subtypes  of  lung  adenocarcinomas.  Moreover, 
revisiting  the  held  cancerization  effect  using  a  compartment al 
coupled  with  a  gradient  or  detailed  molecular  mapping  approach 
in  patients  with  cancer  and  disease-free  individuals  (Figure  1) 
will  shed  light  on  events  in  the  early  pathogenesis  of  lung  adeno¬ 
carcinomas  and  SCCs  and  unravel  biomarkers  that  can  guide 
targeted  and  personalized  chemoprevention  strategies  suitable 
for  each  different  NSCLC  subtype  as  well  as  detection  efforts, 
in  particular  using  less  invasive  sites. 

Despite  numerous  efforts  that  have  centered  on  increasing  our 
understanding  of  the  biology  of  lung  cancer,  this  malignancy  still 
composes  the  biggest  share  of  cancer-related  deaths  in  the  United 
States  and  worldwide.  Compared  with  advances  in  targeted  and  per¬ 
sonalized  therapy  of  NSCLC,  little  progress  has  been  made  in  the 
tailored  prevention  of  this  fatal  malignancy.  This  may  change  with 
the  recent  encouraging  and  significant  findings  of  the  National  Lung 
Screening  Trial  (11).  Various  molecular  markers  and  expression 
classifiers  previously  described  in  the  lung  aii'ways  and  in  less- 
invasive  sites  of  field  canceiization  (e.g.,  nasal  epithelium)  can  aid 
in  selecting  high-risk  individuals  best  suited  for  computed  tomog¬ 
raphy  screening,  for  example.  A  comprehensive  analysis  of  early 
molecular  events  in  NSCLC  pathogenesis  will  undoubtedly  unravel 
biomarkers  that  can  guide  future  chemoprevention  strategies. 

Author  disclosures  are  available  with  the  text  of  this  article  at  www.atsjournals.org. 
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G-Protein  Coupled  Receptor  Family  C,  Group  5,  Member 
A  (GPRC5A)  Expression  Is  Decreased  in  the  Adjacent  Field 
and  Normal  Bronchial  Epithelia  of  Patients  with  Chronic 
Obstructive  Pulmonary  Disease  and  Non-Small-Cell 
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Introduction:  Understanding  oncogenes  and  tumor  suppressor 
genes  expression  patterns  is  essential  for  characterizing  lung  can¬ 
cer  pathogenesis.  We  have  previously  demonstrated  that  mGprcSal 
hGPRC5A  is  a  lung-specific  tumor  suppressor  evidenced  by  inflam¬ 
mation-mediated  tumorigenesis  in  -knockout  mice.  The 

implication  of  GPRC5A  in  human  lung  cancer  pathogenesis,  includ¬ 
ing  that  associated  with  inflanmiatory  chi'onic  obstructive  puhnonary 
disease  (COPD),  a  risk  factor  for  the  malignancy,  remains  elusive. 
Methods:  We  sought  to  examine  GPRC5A  immunohistochemi- 
cal  expression  in  histologically  normal  bronchial  epithelia  (NBE) 
from  lung  disease-free  never-  and  ever-smokers  («  =  13  and  w  =  18, 
respectively),  from  COPD  patients  with  {n  =  26)  and  without  cancer 
{n  =  24)  and  in  non-small  cell  lung  cancers  (NSCLCs)  {n  =  474). 
Quantitative  assessment  of  GPRC5A  transcript  expression  in  airways 
{n  =  6),  adjacent  NBEs  {n  =  29)  and  corresponding  tumors  {n  =  6) 
from  6  NSCLC  patients  was  also  performed. 

Results:  GPRC5A  inummohistochemical  expression  was  signifi¬ 
cantly  lower  in  tumors  compared  to  uninvolved  NBE  {p  <  0.0001)  and 
was  positively  associated  with  adenocarcinoma  histology  {p  <  0.001). 
GPRC5A  airway  expression  was  highest  in  lung  disease-free  NBE, 
decreased  and  intermediate  in  NBE  of  cancer-free  COPD  patients 
ip  =  0.004)  and  further  attenuated  and  lowest  in  epithelia  of  COPD 
patients  with  adenocarcinoma  and  SCC  {p  <  0.0001).  Furthermore, 
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GPRC5A  mRNA  was  significantly  decreased  in  NSCLCs  and  corre¬ 
sponding  NBE  compared  to  uninvolved  normal  lung  {p  =  0.03). 

Conclusions:  Our  findings  highlight  decreased  GPRC5A  expression 
in  the  field  cancerization  of  NSCLC,  including  that  associated  with 
lung  inflanunation.  Assessment  of  the  use  of  GPRC5A  expression  as 
a  risk  factor  for  NSCLC  development  in  COPD  patients  is  warranted. 

KEYWORDS:  Field  cancerization.  Chronic  obstructive  pulmonary 
disease.  Non-small-cell  lung  cancer,  g-protein  coupled  receptor 
family  C,  group  5,  member  A,  gene  expression. 

(JThorac  Oncol  2012;7:  1747-1754) 


Lung  cancer,  the  majority  of  which  is  non-small  cell  lung 
cancer  (NSCLC),  is  the  leading  cause  of  deaths  in  the 
United  States  and  worldwide.^  The  high  mortality  associ¬ 
ated  with  lung  cancer  is  in  part  because  of  late  diagnosis  after 
regional  or  distant  spread  of  the  disease.^  Improved  clinical 
management  of  NSCLC  is  tightly  linked  to  the  identification 
of  new  effective  early  biomarkers  that  can  spear  novel  strate¬ 
gies  for  early  detection,  prevention,  and  treatment.^’^ 

The  majority  (85%)  of  diagnosed  NSCLC  cases  are 
attributable  to  cigarette  smoking."^’^  Auerbach  et  al.  ^  earlier 
showed  that  tobacco  carcinogen  exposure  causes  multifocal 
and  clonal  histopathological  changes  in  the  airway  epithelia 
of  smokers,  suggestive  of  a  field  cancerization  in  the  lung.  In 
addition,  cigarette  smoking  perpetuates  inflammation  thi'ough- 
out  the  smoking-exposed  airway  epithelia  of  heavy  smokers,^ 
which  was  suggested,  to  lead  to  onset  of  lung  cancer  develop¬ 
ment.^  Chronic  obstructive  pulmonary  disease  (COPD)  of  the 
lung  is  an  inflammatory  condition  that  is,  like  lung  cancer, 
causally  linked  to  cigarette  smoking^’^®  and  is  a  major  cause 
of  mortality  in  the  United  States.  Moreover,  preinvasive  lung 
cancer  lesions  are  common  (approximately  50%)  in  airways 
of  COPD  patients.  Importantly,  although  phenotypically 
healthy  smokers  comprise  a  significant  population  at  risk  for 
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lung  cancer, COPD  increases  lung  cancer  risk  by  4.5-fold, 
more  than  either  age  or  quantity  of  smoking.  In  addi¬ 
tion,  33%  of  patients  with  mild-to-moderate  COPD  eventually 
die  because  of  lung  cancer  burden.  Although  smoking  is  the 
main  cause  of  both  diseases,  only  10%  to  15%  of  smokers 
develop  COPD  and/or  lung  cancer,  suggesting  that  other  fac¬ 
tors,  including  genetic  variation  and  expression,  may  differ 
across  individuals  in  response  to  cigarette  smoke."^’^’^’^®  It  is 
plausible  to  assume  that  understanding  common  expression 
patterns  between  both  diseases  for  subsequent  identification 
of  biological  markers  that  explain  risk  of  disease  onset  and/or 
progression  will  favorably  promote  the  clinical  management 
of  COPD-associated  lung  cancer. 

Our  group  has  previously  demonstrated  that  loss  of 
G-protein  coupled  receptor,  family  C,  group  5,  member  A 
(mGprc5a/hGPRC5A),  exemplified  by  the  Gprc5 a-knockout 
mouse  model,  leads  to  chionic  inflammation  and  spontaneous 
lung  tumor  development.  However,  the  expression  of  this 
tumor  suppressor  in  human  lung  NSCLCs,  including  those 
associated  with  inflammatory  conditions  such  as  COPD,  is 
unknown.  We  investigated  the  immunohistochemical  expres¬ 
sion  of  GPRC5A  in  a  large  series  of  NSCLC  histological  tis¬ 
sue  specimens  and  in  normal  bronchial  epithelia  (NBE)  from 
lung-disease-free  individuals,  COPD  patients  and  from  COPD 
patients  with  lung  cancer  and  its  transcript  expression  in  inde¬ 
pendent  resected  normal  epithelia  and  corresponding  tumors. 
Our  findings  reveal  marked  reduced  GPRC5A  expression  in 
human  NSCLC  and  in  the  histologically  normal  field  can- 
cerization  associated  with  COPD  and  lung  cancer.  Our  study 
pinpoints  to  a  potential  role  of  this  tumor  suppressor  in  the  pro¬ 
gression  of  COPD-associated  NSCLC,  which  warrants  further 
studies  to  assess  its  use  as  a  risk  marker  for  this  disease. 

MATERIALS  AND  METHODS 

NSCLC  Tissue  mMcroarrays  and  Normal 
Bronchial  Epithelial  Specimens 

Tissue  microarrays  (TMAs)  used  in  this  study  com¬ 
prised  474  surgically  resected  NSCLC  tumor  specimens  (308 
adenocarcinomas,  166  squamous  cell  carcinomas  [SCCs]) 
collected  under  an  Institutional  Review  Board  protocol  and 
archived  as  formalin-fixed  paraffin-embedded  (FFPE)  speci¬ 
mens  in  the  University  of  Texas  Specialized  Program  of 
Research  Excellence  thoracic  tissue  bank  at  the  University 
of  Texas  M.D.  Anderson  Cancer  Center.  Clinicopathological 
features  of  the  NSCLC  patients  examined  are  summarized  in 
Table  1.  TMAs  were  prepared  with  a  manual  tissue  arrayer 
(Advanced  Tissue  Arrayer  ATAIOO,  Chemicon  International, 
Temecula,  CA)  using  1-mm  diameter  cores  in  triplicate  for 
tumors,  as  described  previously. Histological  sections  4  pm 
in  thickness  were  then  prepared  for  subsequent  immunohisto- 
chemistry  analysis.  FFPE  specimens  of  NBE  from  50  patients 
with  COPD  with  forced  expiratory  volume  in  1  second/forced 
vital  capacity  ratio  of  70%  or  less,  a  smoking  history  of  30  or 
more  pack-years,  and  collected  at  Baylor  college  of  Medicine 
(Houston,  TX)  were  included  in  the  study,  including  24  can¬ 
cer-free  cases  and  26  cases  with  NSCLC.  In  addition,  NBE 
specimens  from  never  smokers  (^  =  13)  and  ever  smokers 


TABLE  1.  Clinicopathological  Features  of  NSCLC  Patients  in 
Tissue  Microarray  Sets  Used  in  the  Study 

Covariate 

Levels 

n  (%) 

Sex 

Female 

240  (50.6) 

Male 

234  (49.4) 

Histology 

Adenocarcinoma 

308  (65.0) 

Squamous  cell  carcinoma 

166  (35.0) 

Stage 

I 

302  (63.7) 

II 

86(18.1) 

III  or  IV 

86(18.1) 

Grade* 

Well 

36  (10.4) 

Moderate 

193  (55.9) 

Poor 

116(33.6) 

Smoking  status 

Current 

178  (37.6) 

Former 

229  (48.3) 

Never 

67(14.1) 

Tobacco  history 

No 

67(14.1) 

Yes 

407  (85.9) 

*Information  on  differentiation  grade  was  available  in  a  subset  of  NSCLCs  analyzed. 
NSCLC,  non-small-cell  lung  cancer. 


(^  =  18)  without  lung  cancer  and  with  no  or  mild  airway 
obstruction  status  were  analyzed. 

Immunohistochemical  Analysis 

Polyclonal  antirabbit  antibodies  raised  against  human 
GPRC5A  were  generated  by  peptide  synthesis  and  site- 
directed  carrier  conjugation  using  keyhole  limpet  hemocyanin 
as  a  custom  service  by  Zymed  Laboratories  Inc.  (South 
San  Francisco,  CA).  The  synthetic  peptide  (cysteine)- 
PSPYKDYEVKKEGS-COOH,  corresponding  to  amino 
acids  344-357  in  the  human  GPRC5A  C-terminus,  was 
covalently  linked  to  keyhole  limpet  hemocyanin  via  an  added 
cysteine  residue,  and  polyclonal  antipeptide  antibodies  were 
prepared  in  rabbits.  Sera  were  confirmed  to  contain  high-titer 
antibodies  against  the  specific  peptide  using  enzyme-linked 
immunosorbent  assays.  TMAs  and  the  histologic  sections  (4 
pm  thick)  of  surgical  resected  specimens  were  deparaffinized 
and  hydrated,  and  antigen  retrieval  was  performed  using  a 
decloaker  with  Dako  target  retrieval  system  at  a  pH  of  6.0  (Dako 
North  America,  Inc.,  Carpinteria,  CA).  Intrinsic  peroxidase 
activity  was  blocked  by  3%  methanol  and  hydrogen  peroxide 
for  12  minutes  and  serum-free  protein  block  (Dako)  was  used 
for  30  minutes  for  blocking  nonspecific  antibody  binding. 
Slides  were  then  incubated  with  the  antibodies  against  human 
GPRC5A  (1 :200  dilution)  at  4°C  overnight.  After  three  washes 
in  Tris-buffered  saline,  slides  were  incubated  for  30  minutes 
with  Dako  Envision  +  Dual  Link  at  room  temperature.  After 
three  additional  washes,  slides  were  incubated  with  Dako 
chromogen  substrate  for  5  minutes  and  were  counterstained 
with  hemotoxylin  for  another  5  minutes.  FFPE  whole-section 
specimens,  except  for  the  omission  of  the  primary  antibodies, 
were  used  as  negative  controls.  The  intensity  and  extent  of 
cytoplasmic  and  nuclear  GPRC5A  immunostaining  were 
evaluated  using  a  light  microscope  (magnification,  x  20) 
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independently  by  two  pathologists  (JF  and  IIW).  GPRC5A 
immunoreactivity  was  mainly  cytoplasmic,  which  was 
quantified  using  a  four- value  intensity  score  (0,  none;  1,  weak; 
2,  moderate;  and  3,  strong)  and  the  percentage  (0-100%)  of 
the  extent  of  reactivity  A  final  cytoplasmic  expression  score 
was  obtained  by  multiplying  the  intensity  and  reactivity 
extension  values  (range,  0-300). 

GPRC5A  Transcript  Expression  in  Resected 
NSCLC  Specimens  and  Adjacent  Airway 
Epithelia 

GPRC5A  mRNA  expression  was  assessed  in  ever-smoker 
NSCLCs  and  their  uninvolved  normal  lung  parenchyma  tis¬ 
sues  as  well  as  in  NBE  collected  by  brushing  of  multiple 
corresponding  airways  from  the  same  lobectomy-  and  pneu¬ 
monectomy-resected  specimens.  Tumor  tissues  from  six  ever- 
smoker  NSCLCs  and  normal  lung  specimens  {n  =  6)  as  well  as 
NBE  by  brushings  {n  =  29)  from  the  same  patients  were  obtained 
under  an  Institutional  Review  Board  approved  protocol  in  which 
specimens  were  collected  from  patients  who  had  signed  letters 
of  consent.  NBE  were  collected  from  multiple  adjacent  to  tumor 
airways  by  brushing  each  site  independently  using  Cytosoft 
cytology  brushes  (Cardinal  Health,  Dublin,  OH).  Parallel 
brushes  were  used  for  touch-prep  for  cytological  assessment  by 
pan-cytokeratin  staining,  which  revealed  epithelial  content  more 
than  90%.  Normal  histology  was  determined  by  hematoxylin 
and  eosin  staining.  Brushes  were  immediately  placed  in  Qiazol 
lysis  reagent  (Qiagen,  Valencia,  CA)  and  stored  in  -80°C  until 
Wther  processing.  NSCLC  and  normal  lung  specimens  were 
shaved  for  histological  assessment  of  percentage  of  tumor  con¬ 
tent  and  malignant  cells  and  for  corresponding  RNA  isolation. 
Total  RNA  from  all  samples  was  puiified  using  the  miRNeasy 
kit  (Qiagen)  according  to  the  manufacturer’s  instructions.  RNA 
quantity  was  determined  using  the  NanoDrop  spectrophotome¬ 
ter,  (Theimo  Scientific,  Wilmington,  DE),  and  quality  was  deter¬ 
mined  by  analysis  of  RNA  integrity  with  Agilent  Bioanalyzer 
2000  (Agilent  Technologies,  Santa  Clara,  CA). 

Quantitative-Real  Time  Polymerase 
Chain  Reaction 

Total  RNA  (150ng)  was  reverse-transcribed  using 
the  high-capacity  RNA-to-cDNA  kit  (Life  Technologies, 
Carlsbad,  CA)  according  to  the  manufacturer’s  instructions. 
Quantitative  real-time  polymerase  chain  reaction  (PCR)  was 
performed  using  TaqMan®  (Applied  Biosystems,  Foster  City, 
CA)  gene  expression  assays  for  GPRC5A  (Hs00173681_ml) 
and  beta-actin  {ACTE)  (Hs99999903_ml)  primers  (Life 
Technologies)  on  a  7900HT  Fast-Real-Time  PCR  System 
(Life  Technologies)  according  to  the  manufacturer’s  instruc¬ 
tions.  All  samples  were  done  in  triplicates  and  normalized  to 
ACTE.  Relative  quantification  was  calculated  using  the  com¬ 
parative  cycle  threshold  method  as  previously  described. 

Statistical  Analysis 

Summary  statistics,  including  frequency  tabulation, 
means,  standard  deviations,  median,  and  range,  were  given 
to  describe  patient  characteristics.  Wilcoxon  rank  sum  test 


or  Kruskal-Wallis  test  was  used  to  compare  GPRC5A 
immunohistochemical  expression  between  two  levels  or  among 
more  than  two  levels,  when  appropriate.  General  linear  model 
was  applied  to  test  the  difference  of  GPRC5A  expression  across 
NBE  from  COPD-  and  cancer-free  never  and  ever  smokers, 
cancer-free  COPD  patients,  as  well  as  NBE  from  patients  with 
both  COPD  and  cancer.  Repeated  measures  analysis  was  used  to 
determine  significance  of  GPRC5A  transcript  variation  across 
matched  NSCLCs,  airways,  and  uninvolved  normal  lung.  All 
statistical  tests  were  two-sided,  and p  values  of  0.05  or  less  were 
considered  to  be  statistically  significant.  Statistical  analysis  was 
performed  with  standard  statistical  software,  including  SAS 
Release  9.1.3  for  Windows  and  S-Plus  8.0  for  Windows. 


RESULTS 

GPRC5A  Immunohistochemical  Expression  in 
Human  NSCLC 

We  have  previously  demonstrated  that  mGprcSa! 
\\GPRC5A  is  a  lung-specific  tumor  suppressor  evidenced 
by  spontaneous  tumor  development  in  mice  with  knockout 
of  this  gene  as  well  as  loss  of  the  transformed  phenotype  in 
malignant  NSCLC  cell  lines  after  GPRC5A  overexpression. 
We  sought  to  examine  the  immunohistochemical  expression 
patterns  of  GPRC5A  tumor  suppressor  in  NSCLC,  which  is 
unknown,  and  its  association  with  clinicopathological  fea¬ 
tures  of  the  disease,  including  the  two  major  histologic  types 
examined,  adenocarcinoma  and  SCC.  The  clinicopatho¬ 
logical  characteristics  of  the  NSCLC  patients  from  which 
FFPE  tumor  specimens  were  analyzed  are  detailed  in  Table 
1.  Representative  photomicrographs  of  GPRC5A  immuno¬ 
histochemical  expression  in  human  lung  adenocarcinoma 
(left)  and  SCC  (right)  are  depicted  in  Figure  \A.  GPRC5A 
immunoreactivity  was  mainly  cytoplasmic.  GPRC5A 
expression  was  significantly  lower  in  the  adenocarcinomas 
{n  =  308;  mean,  48.91  ±36;  median,  41.67;  minimum,  0; 
maximum,  180)  and  SCCs  {n  =  166;  mean,  23.47 ±22.79; 
median,  18.33;  minimum,  0;  maximum,  110)  examined  com¬ 
pared  with  the  uninvolved  normal  bronchial  epithelia  (mean, 
164.52 ±34.04;  median,  160;  minimum,  100;  maximum, 
230)  from  lung-disease-free  smokers  and  never  smokers 
ip  <  0.0001)  (Fig.  \E).  Moreover,  GPRC5A  immunohisto¬ 
chemical  expression  was  significantly  lower  in  SCCs  com¬ 
pared  with  that  in  adenocarcinomas  {p  <  0.0001)  (Fig.  \E). 
We  then  correlated  GPRC5A  with  other  clinicopathological 
features  for  all  NSCLCs.  Notably,  GPRC5A  expression  was 
significantly  increased  in  NSCLC  tumor  specimens  fi'om 
never  smokers  {n  =  67)  (mean,  53.94 ±39.94;  median,  40; 
minimum,  0;  maximum,  163.33)  compared  with  ever  smokers 
(^  =  407)  (mean,  37.71  ±32.65;  median,  30;  minimum,  0;  max¬ 
imum,  180)  {p  =  0.001)  (Fig.  IQ.  GPRC5A  expression  was 
also  significantly  different  among  never,  former  and  current 
smoker  NSCLCs  (p  =  0.003)  (Fig.  ID).  In  addition,  GPRC5A 
expression  was  significantly  positively  associated  with  well- 
differentiated  tumor  grade  in  a  subset  of  NSCLC  tumors 
{n  =  346)  examined  with  available  grade  information  {p  =  0.004) 
(Fig.  IQ.  It  is  important  to  note  that  when  we  examined  each 
histology  type  separately,  there  were  no  statistically  significant 
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FIGURE  1.  GPRC5A  immunohistochemical  expression  in  NSCLC  histological  tissue  specimens*  A,  Representative  photomi¬ 
crographs  of  GPRC5A  immunohistochemical  expression  in  lung  adenocarcinoma  (left)  and  SCC  (right)  observed  under  a  light 
microscope  at  a  magnification  of  20  x.  Bars,  1  00  pm*  8,  Box-plots  depicting  differences  in  GPRC5A  immunohistochemical 
expression  among  NBE,  Adeno,  and  SCCs*  C,  Box-plots  depicting  GPRC5A  expression  based  on  tobacco  history  (never  ver¬ 
sus  ever  smokers),  (D),  smoking  status  (never,  former  and  current),  and  (£),  differentiation  grade  (well,  moderate,  and  poor)* 
p  values  based  on  the  Wilcoxon  rank  sum  test  and  Kruskal-Wallis  test*  GPRC5A,  G-protein  coupled  receptor  family  C,  group 
5,  member  A;  NSCLC,  non-small-cell  lung  cancer;  SCC,  squamous  cell  carcinomas;  NBE,  normal  bronchial  epithelia;  Adeno, 
adenocarcinomas* 


correlations  between  GPRC5  A  expression  and  patient’s  smok¬ 
ing  history  and  status,  and  tumor-differentiation  grade. 

GPRC5A  Immunohistochemical  Expression 
in  NBE  from  COPD  Patients 

We  have  previously  shown  that  loss  of  mGprcSa/ 
hGPRC5A  tumor  suppressor  leads  to  chronic-inflammation- 
mediated  prosurvival  signaling  and  transformation  of 
nonmalignant  lung  epithelial  cells.  The  relevance  of 
GPRC5A  expression  to  human  NSCLC  pathogenesis, 
including  that  associated  with  COPD,  a  chi'onic  inflammatory 
condition  and  risk  factor  for  lung  cancer,  remains  elusive.  In 
light  of  the  strong  association  between  inflammation,  COPD 
and  NSCLC,^®’^^  we  sought  to  examine  the  expression  of 
GPRC5A  NBE  in  COPD-associated  adenocarcinoma  and 
SCC.  GPRC5  A  expression  was  assessed  in  histological  tissue 
specimens  of  NSCLC  and  NBE,  including  those  from  lung 
cancer- free  COPD  and  from  NSCLC  patients.  Representative 
photomicrographs  of  GPRC5A  immunohistochemical 
staining  in  the  different  NBE  are  depicted  in  Figure  2A. 
GPRC5A  immunoreactivity  seemed  to  be  highest  in  NBE 
from  lung-disease-free  never  and  ever  smokers,  successively 
significantly  decreased  in  NBE  of  COPD  patients  who  are 
cancer- free,  and  lowest  in  NBE  of  patients  with  both  COPD 
and  adenocarcinoma  or  SCC  (Fig.  2B).  A  general  linear  model 


demonstrated  a  significant  gradual  decrease  of  GPRC5A 
expression  from  NBE  of  disease-free  never  smokers  to 
NBE  from  patients  with  COPD  and  adenocarcinoma  or 
SCC  (p  <  0.0001)  (Fig.  2B).  The  mean  (177.31  ±30.66)  and 
median  (195;  minimum,  130;  maximum,  220)  of  GPRC5A 
expression  in  NBE  of  lung-disease-free  never  smokers 
were  higher  than  that  of  smokers  (mean,  155.28  ±34.15, 
median,  147.5;  minimum,  100;  maximum,  230)  although 
the  differences  were  not  statistically  significant  (Fig.  2B). 
Moreover,  GPRC5A  expression  was  significantly  higher 
in  NBE  of  lung-disease-free  smokers  relative  to  NBE  in 
cancer-free  COPD  patients  (mean,  85.48 ±53.55,  median, 
72.88;  minimum,  20;  maximum,  211)  (p  =  0.007),  which  in 
turn  was  significantly  higher  compared  with  the  expression 
in  NBE  from  COPD  patients  with  adenocarcinoma  (mean, 
48.15 ±52.84,  median,  35;  minimum,  0;  maximum,  170) 
(p  <  0.0001)  or  SCC  (mean,  32.24 ±34.9,  median,  12.5; 
minimum,  0;  maximum,  101.69)  (p  <  0.0001)  (Fig.  2B). 
Furthermore,  although  GPRC5A  expression  was  lower  in 
NBE  of  COPD  patients  with  SCC  compared  with  those 
with  COPD  and  adenocarcinoma,  the  difference  was  not 
statistically  significant  (Fig.  2B).  These  findings  suggest  that 
reduced  expression  of  the  GPRC5A  tumor  suppressor  may 
be  implicated  in  the  pathogenesis  of  NSCLC  associated  with 
inflammatory  COPD* 
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FIGURE  2.  Reduced  GPRC5A  expression  in  the  sequential  pathogenesis  of  COPD-associated  NSCLC.  A,  Representative 
photomicrographs  of  GPRC5A  immunohistochemical  expression  in  NBE  from  disease-free  never  and  ever  smokers  (upper 
left  and  upper  right,  respectively)  and  in  NBE  from  cancer-free  COPD  patients  (bottom  left)  and  NSCEC  patients  (bot¬ 
tom  right)  with  COPD  observed  under  a  light  microscope  at  a  magnification  of  x  20.  Bars,  1  00  pm.  8,  Box-plots  depicting 
quantification  of  GPRC5A  immunohistochemical  expression  represented  in  (A).  Main  p  value  signifies  statistical  significance 
of  reduced  GPRC5A  expression  in  the  sequential  pathogenesis  of  COPD-associated  NSCEC  and  among  the  five  groups  by 
the  general  linear  model  and  Kruskal-Wallis  test,  p  values  signifying  the  indicated  pair  wise  comparisons  among  the  groups 
were  obtained  by  the  Wilcoxon  rank  sum  test.  GPRC5A,  G-protein  coupled  receptor  family  C,  group  5,  member  A;  NBE, 
normal  bronchial  epithelia;  NSCLC,  non-small-cell  lung  cancer;  COPD,  chronic  obstructive  pulmonary  disease;  SCC,  squa¬ 
mous  cell  carcinomas;  NS,  not  significant. 


We  then  examined  GPRC5A  expression  in  epithelial 
cells  from  different  compartments  of  the  lung  (bronchial, 
bronchiolar  and  alveolar  epithelia  and  cells).  GPRC5A 
expression  was  statistically  significantly  lower  in  bronchi 
and  bronchioles  from  cancer-free  COPD  patients  (p  <  0.005) 
and  patients  with  both  COPD  and  SCCs  or  adenocarcinomas 
(p  <  0.0001)  compared  with  lung-disease-free  never  smokers 
(Supplementary  Table  1,  Supplemental  Digital  Content  1, 
http://links.lww.com/JTO/A334).  In  contrast,  GPRC5A 
expression  in  alveolar  compartment  remained  high  (score 
average  172.2-261.5)  in  all  groups  of  cases  examined,  and 
there  were  no  statistically  significant  differences  based  on 


COPD  status  (Supplementary  Table  1,  Supplemental  Digital 
Content  1,  http://links.lww.com/JTO/A334). 

CPRC5A  Expression  in  Smoker  NSCLC  Field 
Cancerization 

Our  previous  findings  on  GPRC5A  tumor-suppressive 
function^^’^^’^^  as  well  as  our  current  observation  on  significant 
and  progressive  reduced  expression  of  this  tumor  suppressor  in 
NBE  from  lung  cancer-free  COPD  patients  and  patients  with 
NSCLC,  prompted  us  to  probe  whether  GPRC5A  expression 
exhibits  a  field  cancerization  pattern  in  NSCLC.  We  sought 
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to  assess  GPRC5A  transcript  expression  in  multiple  bronchial 
sites  {n  =  29)  as  well  as  in  the  tumors  {n  =  6)  and  uninvolved 
normal  lung  parenchyma  {n  =  6)  in  six  NSCLC  cases  (3  SCCs 
and  3  adenocarcinomas).  Quantitative  real-time  PCR  analysis 
demonstrated  progressively  reduced  GPRC5A  expression  from 
noiTnal  uninvolved  lung  parenchyma  (relative  quantification 
[RQ]  mean,  4.18 ±0.62;  median,  4.16;  minimum,  3.51;  maxi¬ 
mum,  5.04)  to  histologically  normal  and  adjacent  bronchial  epi- 
thelia  (RQ  mean,  3. 12 ±1.94;  median,  2.86,;  minimum,  0.59; 
maximum,  10.12)  to  NSCLC  tumors  (RQ  mean,  1.35  ±0.71; 
median  1.31;  minimum  0.24;  maximum  2.30),  which  exhibited 
on  an  average  the  lowest  expression  of  the  gene  (Fig.  ?>A  and  5) 
ip  =  0.03).  These  findings  suggest  that  GPRC5A  expression  is 
reduced  in  the  smoking-injured  field  cancerization  in  NSCLC 
highlighting  a  potentially  strong  implication  of  this  tumor  sup¬ 
pressor  in  the  pathogenesis  of  this  malignancy. 

DISCUSSION 

In  this  study,  we  assessed  the  expression  of  GPRC5A, 
which  we  had  found  to  exhibit  lung-specific  tumor-suppres¬ 
sor  properties  in  mice,^^’^^’^^  in  human  NSCLC  histological 
tissue  specimens,  as  well  as  in  NBE  from  COPD  and  lung 
adenocarcinoma  and  SCC  patients.  We  found  that  the  protein 
product  of  human  GPRC5A  was  lower  in  lung  tumors  com¬ 
pared  with  uninvolved  histologically  NBE,  and  was  signifi¬ 
cantly  associated  with  adenocarcinoma  histology.  GPRC5A 
airway  expression  seemed  to  exhibit  a  progressive  decrease  in 
the  field  cancerization  of  COPD-related  NSCLC  with  highest 
expression  in  bronchial  epithelia  obtained  from  disease-free 
individuals,  intermediate  immunoreactivity  in  normal  epithe¬ 
lia  from  cancer-free  COPD  patients,  and  lowest  in  normal  epi¬ 
thelia  from  patients  with  both  COPD  and  adenocarcinoma  and 
SCC  histologies.  Furthermore,  GPRC5A  transcript  expression 
was  also  lower  in  histologically  NBE  as  well  as  in  adjacent 
and  corresponding  smoker  NSCLC  tumors,  irrespective  of 


COPD  status,  compared  with  uninvolved  normal  lung  sugges¬ 
tive  of  a  field  cancerization-mediated  expression  modulation 
of  this  G-protein  coupled  receptor.  In  light  of  the  relevance 
of  the  field  cancerization  phenomenon  to  NSCLC  pathogen- 
esis,^“^’^®’^^  our  findings  pinpoint  to  a  potential  tumor- suppres¬ 
sive  role,  similar  to  that  established  in  mice,  of  GPRC5A  in 
the  sequential  development  of  human  NSCLC,  in  particular 
those  associated  with  inflammatory  chionic  obsfiuctive  dis¬ 
ease.  NSCLC  and  COPD  are  both  mainly  attributable  to  ciga¬ 
rette  smoke^’^®’^^  and  are  leading  causes  of  deaths  in  the  United 
States  and  worldwide.  Given  that  COPD  is  a  major  risk 
factor  for  lung  cancer  and  shares  various  pathogenic  features 
with  lung  tumors,^’^®’^^“^^  better  molecular  markers  are  needed 
to  identify  which  COPD  patients  will  continue  to  develop  lung 
malignancies.^^  Thus,  the  results  herein  raise  the  intriguing 
possibility  that  GPRC5A  loss  may  be  a  useful  biomarker  in 
assessing  the  risk  of  NSCLC  development  in  COPD  patients. 

COPD  and  infections  as  well  as  inflammatory  disorders 
of  the  respiratory  tract  may  be  linked  to  an  increased 
risk  of  lung  cancer. We  have  previously  demonstrated 
significantly  increased  macrophage  infiltration  into  lungs  of 
GprcSa-kROckout  mice  along  with  their  direct  association 
with  adenocarcinomas,  which  was  accompanied  by  higher 
constitutive  levels  of  proinflammatory  cytokines  and 
chemokines  and  increased  susceptibility  to  stimulation  of 
nuclear  factor-kappa  B  (NF-kB)  activation  in  vivo.^^  We  also 
showed  that  loss  of  GprcSa-modmiod  activation  of  NF-kB 
was  causally  linked  to  macrophage  recruitment  and  enhanced 
inflammation  toward  the  creation  of  a  tumor-promoting 
microenvironment.^^  It  is  noteworthy  that  lungs  of  patients 
with  severe  COPD  exhibit  macrophage/CD 8+  T  cell  and 
neuti'ophil  infiltration  based  on  histopathological  studies  and 
bronchial-biopsies.^^  Moreover,  it  is  worthwhile  to  mention 
that  we  attempted  to  statistically  assess  the  association 
between  expression  of  GPRC5A  and  that  of  NF-kB  (nuclear 
p65  immunoreactivity)  in  NSCLCs  and  found  a  significant 


FIGURE  3.  Decreased  GPRC5A  mRNA  expression 
in  histologically  NBE  and  corresponding  NSCLC 
tumors  from  resected  specimens  compared  with 
matched  uninvolved  normal  lung.  Total  RNA  was 
isolated  from  brushings  of  NBE  and  frozen  sec¬ 
tions  of  matched  NSCLC  (T)  and  uninvolved  NL 
parenchyma  obtained  from  resected  lobectomy  or 
pneumonectomy  specimens  performed  on  smoker 
early-stage  patients.  A,  GPRC5A  expression  was 
analyzed  in  all  samples  (in  triplicates)  by  QRTPCR 
and  normalized  against  that  of  AGTB  to  yield  a 
relative  expression  by  the  2“^^^  method  depicted 
in  the  graphs.  NSCLC  tumors  (squamous  cell 
carcinoma,  cases  1-3,  and  adenocarcinomas,  cases 
4-6),  uninvolved  NL  parenchyma  and  bronchial 
structures  are  labeled  by  the  indicated  colors.  B, 
GPRG5A  transcript  expression  among  uninvolved 
NL  tissue,  NBE,  and  NSCLC  tumors  was  assessed 
statistically  by  repeated  measures  analysis.  NBE, 
normal  bronchial  epithelia;  NSCLC,  non-small-cell 
lung  cancer;  NL,  normal  lung;  CT,  comparative 
cycle  threshold  method.;  QRTPCR,  quantitative 
real-time  polymerase  chain  reaction. 
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inverse  correlation  between  both  proteins  in  particular 
in  adenocarcinomas  (data  not  shown).  In  addition,  it  was 
recently  shown  that  the  incidence  of  hyperplastic  lesions  in 
lungs  of  Gprc5a-knockout  mice  was  increased  after  exposure 
to  nontypeable  Haemophilus  influenzae  Notably,  bacterial 
colonization,  particularly  with  nontypeable  Haemophilus 
influenzae,  has  been  implicated  as  a  cause  of  airway 
inflammation  in  COPD  besides  cigarette  smoke. In  this 
context,  the  commonalities  between  inflammatory-mediated 
and  histopathological  mechanisms  in  GprcSa-knockovX  mice 
and  those  evident  in  lungs  of  COPD  patients  along  with  our 
current  findings  on  reduced  GPRC5A  expression  in  normal 
cells  from  COPD  patients  pinpoint  to  a  tumor-suppressive  role 
of  this  gene  in  COPD-associated  human  lung  tumorigenesis. 

We  previously  cloned  the  retinoid-regulated  mGprcSa! 
\vGPRC5A  and  found  that  it  was  preferentially  expressed  in  fetal 
and  adult  mouse  and  human  lung  tissue  compared  with  nor¬ 
mal  specimens  from  other  anatomically  distinct  organs. The 
functional  relevance  of  this  lung-specific  expression  was  high¬ 
lighted  by  our  earlier  study  demonstrating  spontaneous  devel¬ 
opment  of  adenomas  and  adenocarcinomas  in  Gprc5a-nockout 
mice,  which  was  not  observed  in  wild-type  littermates.  In  addi¬ 
tion,  spontaneous  tumor  development  in  Gprc5a-knockout 
mice  was  characterized  by  late  tumor  onset  (12-16-month-old 
mice)  and  low  multiplicity.^^  Notably,  we  reported  that  expo¬ 
sure  to  the  tobacco-specific  carcinogen  4-(methylnifiosamino)- 
l-(3-pyridyl)-l-butanone  (nicotine-derived  nitrosamine  ketone) 
augmented  tumorigenesis  evidenced  by  6-month  earlier  devel¬ 
opment  of  lesions,  two-  to  fourfold  increased  tumor  incidence 
and  multiplicity,  as  well  as  a  dramatic  increase  in  lesion  size.^^ 
It  is  worthy  to  note  that,  in  this  study,  we  found  significantly 
reduced  expression  of  GPRC5A  in  human  smoking-injured 
histologically  NBE  adjacent  to  lung  tumors  and  in  the  local¬ 
ized  field  cancerization  of  smoker  NSCLC  patients,  compared 
with  uninvolved  normal  lung.  Moreover,  reduced  GPRC5A 
expression  was  concomitant  among  airways  and  correspond¬ 
ing  NSCLC  compared  with  matched  normal  lung  tissue.  It  is 
plausible  to  assume  that  smoking-mediated  reduced  airway 
expression  of  GPRC5A  in  patients  may  explain  our  previous 
observations  on  augmented  tumorigenesis  in  GprcSa-^ockout 
mice  after  exposure  to  tobacco  carcinogens.  Because  COPD  is 
causally  linked  to  smoking,*^  it  cannot  be  neglected  that 
reduced  GPRC5A  expression  in  NBE  from  COPD  patients 
may  be  because  of  the  molecular  impact  of  smoking  on  the 
airway  transcriptome.  However,  it  is  important  to  mention 
that  we  found  significantly  decreased  GPRC5A  expression  in 
normal  epithelia  from  COPD  patients  compared  with  epithe- 
lia  obtained  from  disease-free  smokers.  Moreover,  GPRC5A 
expression  was  further  decreased  and  lowest  in  normal  airway 
epithelia  from  COPD  patients  with  NSCLC  compared  with 
those  from  cancer-free  COPD  patients.  Thus  in  this  context, 
our  findings  raise  the  possibility  that  GPRC5A  expression  may 
be  impacted  by  cigarette  smoking  and  may  be  both  a  cause  and 
consequence  of  increased  inflammation  in  tumor-promoting 
microenvironment  in  the  lung  as  well  as  NSCLC  development, 
which  may  be  better  delineated  in  future  warranted  studies. 

Our  group  has  previously  shown  that  human  lung  and 
head  and  neck  cancer  cell  lines  exhibit  reduced  GPRC5A 
expression  compared  with  their  normal  counterparts.^^  It  is 


worthwhile  to  mention  that  treatment  of  cancer  cells  with 
all-trans  retinoic  acid  restored  GPRC5A  expression  in  cells 
devoid  of  the  tumor  suppressor  in  part  by  a  retinoic  acid 
receptor-dependent  manner.^^’^^  In  addition  and  in  a  sepa¬ 
rate  study,  we  have  previously  noted  reduced  retinoic  acid 
receptor  beta  (RAR(3)  in  approximately  50%  of  adenocar¬ 
cinomas  and  SCCs  as  well  as  loss  of  RARy  and  retinoid  X 
receptorp  in  a  significant  fr  action  of  the  tumors.^^  Moreover, 
RARp  expression  is  decreased  in  bronchial  epithelia  and 
preneoplastic  lesions  representing  the  sequence  of  lung  SCC 
pathogenesis.^"^  More  recently,  we  demonstrated,  using  gene 
expression  profiling,  that  loss  of  murine  Gprc5a  in  lung  epi¬ 
thelial  cells  reduced  expression  of  markers  of  squamous  dif¬ 
ferentiation  concomitantly  with  an  increase  in  mediators  of 
the  inflammatory  process  (e.g.,  NF-kB  signaling).  In  light 
of  the  biological  connection  between  retinoid  signaling  and 
GPRC5A  expression  as  well  as  our  current  findings  in  this 
revised  article  on  the  significant  reduced  expression  of  the 
tumor  suppressor  in  NBE  of  COPD  patients  including  those 
with  NSCLC,  it  is  plausible  to  suggest  that  retinoid  signal¬ 
ing  may  be  aberrantly  regulated  in  pathogenesis  of  COPD- 
associated  NSCLC.  It  is  important  to  note  that  retinoids  are 
currently  being  tested  for  their  lung  regenerative  properties 
and  effects  in  patients  with  COPD.^^ 

In  conclusion,  we  report  herein  decreased  expression  of 
GPRC5A  in  human  NSCLC  tissue  compared  with  uninvolved 
NBE  and  in  the  field  cancerization  of  smoker  NSCLCs  pin¬ 
pointing  to  a  tumor-suppressor  role  of  this  G-protein  coupled 
receptor  in  the  pathogenesis  of  this  leading  cause  of  cancer- 
related  deaths.  Moreover,  our  findings  highlight  a  progressive 
decrease  in  GPRC5A  expression  in  the  sequential  pathogen¬ 
esis  of  NSCLCs  arising  in  COPD  patients,  warranting  future 
studies,  including  the  analysis  of  lung  cancer  preneoplastic 
lesions,  to  assess  the  potential  of  the  utility  of  this  gene  as  a 
biomarker  for  lung  cancer  risk  in  COPD  patients. 
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Abstract 

Gene  expression  alterations  in  response  to  cigarette  smoke  have  been  characterized  in  normal-appearing 
bronchial  epithelium  of  healthy  smokers,  and  it  has  been  suggested  that  adjacent  histologically  normal 
tissue  displays  tumor-associated  molecular  abnormalities.  We  sought  to  delineate  the  spatial  and  temporal 
molecular  lung  field  of  injury  in  smoker  patients  with  early-stage  non-small  cell  lung  cancer  (NSCLC; 
n  =  19)  who  were  accrued  into  a  surveillance  clinical  trial  for  annual  follow-up  and  bronchoscopies  within 
1  year  after  definitive  surgery.  Bronchial  brushings  and  biopsies  were  obtained  from  six  different  sites  in  the 
lung  at  the  time  of  inclusion  in  the  study  and  at  12,  24,  and  36  months  after  the  first  time  point.  Affymetrix 
Human  Gene  1.0  ST  arrays  were  used  for  whole-transcript  expression  profiling  of  airways  (n  =  391). 
Microarray  analysis  identified  gene  features  [n  =  1,165)  that  were  nonuniform  by  site  and  differentially 
expressed  between  airways  adjacent  to  tumors  relative  to  more  distant  samples  as  well  as  those  (n  =  1,395) 
that  were  significantly  altered  with  time  up  to  3  years.  In  addition,  gene  interaction  networks  mediated 
by  phosphoinositide  3-kinase  (PI3K)  and  extracellular  signal-regulated  kinase  (ERK)l/2  were  modu¬ 
lated  in  adjacent  compared  with  contralateral  airways  and  the  latter  network  with  time.  Furthermore, 
phosphorylated  ART  and  ERKl/2  immunohistochemical  expression  were  significantly  increased  with 
time  (nuclear  pAKT,  P  =  0.03;  cytoplasmic  pAKT,  P  <  0.0001;  pERKl/2,  P  =  0.02)  and  elevated  in 
adjacent  compared  with  more  distant  airways  (nuclear  pAKT,  P  =  0.04;  pERKl/2,  P  =  0.03).  This  study 
highlights  spatial  and  temporal  cancer- associated  expression  alterations  in  the  molecular  field  of  injury 
of  patients  with  early-stage  NSGLGs  after  definitive  surgery  that  warrant  further  validation  in  indepen¬ 
dent  studies.  Cancer  Prev  Res;  6(1);  8-17.  ©2012  AACR. 


Introduction 

Lung  cancer,  of  which  non-small  cell  lung  cancer 
(NSGLG)  comprises  the  majority,  is  the  leading  cause 
of  cancer-related  deaths  in  the  United  States  and  worldwide 
(1,2).  The  high  mortality  of  this  disease  is,  in  part,  due  to  the 
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late  diagnosis  of  the  majority  of  lung  cancers  after  regional 
or  distant  spread  of  the  malignancy  (3).  Recent  data  from 
the  National  Lung  Screening  Trial  (4),  indicating  that 
screening  increases  early  detection  rates,  are  expected  to 
augment  the  number  of  early-stage  NSGLGs  detected  war¬ 
ranting  the  need  for  better  clinical  management  of  this 
growing  subpopulation.  Besides  adjuvant  therapy,  there 
are  no  effective  chemoprevention  strategies  for  patients 
with  early-stage  NSGLGs  (5)  who  comprise  approximately 
50%  of  all  diagnosed  cases  and  have  a  relatively  high  rate  of 
relapse  (3).  Improved  clinical  management  of  early-stage 
NSGLGs  is  tightly  linked  to  the  identification  of  new  effec¬ 
tive  early  biomarkers  that  can  guide  potential  chemopre¬ 
vention  strategies. 

Most  diagnosed  NSGLG  cases  (85%)  are  attributable  to 
cigarette  smoking  (6-8).  Auerbach  and  colleagues  found 
that  cigarette  smoke  induces  extensive  histologic  changes 
in  the  bronchial  epithelia  in  the  lungs  of  smokers  and 
that  premalignant  lesions  are  widespread  and  multifocal 
throughout  the  respiratory  epithelium,  suggestive  of  a  field 
effect  (9).  Many  molecular  abnormalities,  such  as  LOH 
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(10-12),  mutations  in  TP53  (13),  methylation  of  pi  6  tumor 
suppressor,  death-associated  protein  kinase  [DAPK),  and 
retinoic  acid  receptor  2  beta  [RAR-^I),  were  detected  in 
bronchial  epithelia  of  cancer-free  former  smokers  (14-18), 
some  of  which  persist  for  many  years  after  smoking  cessa¬ 
tion  (15).  More  recently,  global  mRNA  and  miRNA  expres¬ 
sion  profiles  have  been  described  in  the  normal-appearing 
bronchial  epithelium  of  healthy  smokers  (19,  20)  that  are 
different  from  those  in  nonsmokers.  Moreover,  expression 
and  pathway  signatures  have  been  derived  from  normal 
bronchial  epithelium  of  smokers  that  exhibited  diagnostic 
properties  (21,  22).  Molecular  changes  involving  LOH  of 
chromosomal  regions  3p  [DDUT  and  FHIT  genes),  9p 
{CDKN2A),  genomic  instability  (increased  microsatellite 
repeats),  and  pi  6  methylation  have  been  shown  in  histo¬ 
logically  normal  epithelium  in  patients  with  squamous  cell 
carcinoma  and  in  the  sequence  of  pathogenesis  of  the 
disease  (14,  23,  24).  Moreover,  Nelson  and  colleagues 
showed  that  KRAS  is  also  mutated  in  histologically  normal 
lung  tissue  adjacent  to  lung  tumors  (25) .  Furthermore,  Tang 
and  colleagues  found  higher  rates  of  EGFR  mutations  in 
adjacent  normal  bronchial  epithelia  (NBE;  refs.  26,  27) 
suggestive  of  a  potential  localized  field  effect. 

It  is  plausible  to  assume  that  understanding  early  molec¬ 
ular  aberrations  in  histologically  normal  smoke-damaged 
airway  epithelium  of  early-stage  patients  would  serve  as  a 
critical  first  step  toward  identification  of  biomarkers  that 
can  guide  lung  cancer  prevention  strategies.  However,  the 
global  molecular  airway  field  of  injury  in  patients  with 
early-stage  NSCLCs,  in  particular  after  definitive  surgery,  is 
unknown.  In  this  study,  we  used  transcript-level  expression 
profiling  coupled  with  gene  interaction  network  analysis 
and  immunohistochemical  (IHC)  analysis  to  characterize, 
in-depth,  site-  and  time-dependent  global  molecular 
alterations  in  airways  of  smoker  patients  with  early-stage 
NSCLCs. 

Materials  and  Methods 

Patient  population  and  airway  epithelial  cell  collection 

Early-stage  (I/II),  current  or  former  smoker  patients  with 
NSCLCs  with  at  least  a  10 -pack-year  smoking  history  and 
without  evidence  of  disease  after  definitive  surgery  were 
recruited  into  the  Vanguard  phase  II  surveillance  clinical 
trial  (clinical  trial  number  NCT00352391)  within  1  year 
from  time  of  surgery.  Patients  were  accrued  between  2004 
and  2008  and  underwent  frequent  testing  including  chest  X- 
rays,  computed  tomographic  (CT)  scans,  laboratory  work, 
serologies,  flexible  bronchoscopies,  and  airway  biopsy  col¬ 
lections  within  1  year  from  surgery  (average,  6  months; 
range,  1-12;  first  time  point),  and  at  months  12,  24,  and 
36  following  the  first  time  point.  Bronchoscopies  were 
conducted  using  white  light  or  both  white  light  and  auto¬ 
fluorescence  modalities.  Biopsies  were  obtained  from  all 
potential  anatomic  locations  and  time  points  per  patient.  In 
total,  there  were  324  evaluable  airway  biopsies.  Histologic 
assessment  was  conducted  to  determine  whether  malignant 
changes  will  occur  during  the  time  period.  Once  patients 
have  completed  3  years  of  testing,  they  were  followed  until 


the  study  is  completed.  Patients  were  composed  of  former 
{n  =  16)  and  current  [n  =  3)  smokers.  One  of  the  3  current 
smoker  patients  quit  smoking  6  months  before  the  24- 
month  time  point.  The  clinicopathologic  variables  of 
patients  in  this  study  are  summarized  in  Table  1.  The  study 
was  approved  by  the  Institutional  Review  Boards,  and  all 
participants  provided  written  informed  consents. 

Bronchial  airway  epithelial  cells  were  obtained  from  5  to 
6  different  sites  (main  carina,  4  airways  from  4  lobes,  and 
the  bronchial  stump  or  main  stem  bronchus  adjacent  to  the 
originally  resected  tumor  and  lobe;  Pig.  1 )  at  each  time  point 
using  an  Olympus  fiberoptic  bronchoscope  (Olympus 
America  Inc.)  and  cytobrushes  (Cellebrity  Endoscopic 
Cytobrush,  Boston  Scientific).  Patients  (n  =  19)  with  sam¬ 
ples/specimens  available  for  analysis  that  were  obtained 
serially  up  to  either  24  or  36  months  and  from  at  least  5 
different  sites  at  each  time  point  [n  =  391  airway  samples) 
were  selected  for  the  study.  Epithelial  cell  content  was 
confirmed  by  cytokeratin  staining  which  yielded  a  90% 
epithelial  cell  content.  Brushes  were  immediately  placed 
in  serum-free  RPMI  medium  on  ice,  vortexed  gently  to 
disperse  epithelia  into  the  media,  and  then  removed.  Sam¬ 
ples  were  then  immediately  centrifuged,  and  cell  pellets 
were  resuspended  in  1  mL  of  PBS.  About  500  pL  of  the 
sample  was  then  again  centrifuged,  and  the  pellet  resus¬ 
pended  in  500  pL  P-mercaptoethanol  containing  RET  buffer 
(Qiagen),  homogenized,  and  stored  in  -SO'^C  until  further 
processing.  Total  RNA  was  isolated  using  the  RNeasy  Mini 
Kit  according  to  the  manufacturer's  instructions  (Qiagen). 

RNA  processing  for  microarrays 

Total  RNA  samples  were  preprocessed  for  subsequent 
hybridization  to  expression  arrays  using  the  WT-Ovation 
and  Encore  Biotin  Module  from  NuGEN  Technologies  Inc. 
(San  Carlos,  CA)  according  to  the  manufacturer's  instruc¬ 
tions.  Briefly,  the  WT-Ovation  Pico  RNA  amplification 
system  was  used  to  generate  amplified  cDNA  using  5  ng  of 
starting  RNA  material.  After  formation  of  double  stranded 
cDNA,  DNA  was  amplified  using  the  SPIA  Amplification 
Method,  a  linear  isothermal  DNA  amplification  process 
developed  by  the  vendor  (NuGEN  Technologies).  The 
WT-Ovation  Exon  Module  (NuGEN  Technologies)  was 
then  used  for  generation  of  amplified  sense  strand  cDNA 
(ST-cDNA)  that  is  suitable  for  subsequent  array  analysis 
with  the  Affymetrix  Human  Gene  1.0  ST  array  platform 
(Affymetrix,  Santa  Clara,  CA).  Eragmented  and  biotin- 
labeled  cDNA  was  then  generated  using  the  Encore  Biotin 
Module  (NuGEN  Technologies)  using  5  pg  of  amplified 
cDNA.  Quality  and  size  distribution  of  unfragmented  SPIA- 
amplified  cDNA  and  subsequent  fragmented  labeled  cDNA 
were  assessed  by  loading  samples  on  an  RNA  6000  Nano 
LabChip  (Agilent)  and  analysis  with  Agilent  Bioanalyzer 
2000  (Agilent).  No  differences  in  quality  were  noted  on  the 
basis  of  the  duration  of  sample  storage. 

Generation  of  microarray  raw  data  and  analysis 

Eragmented  and  labeled  cDNA  (2.5  pg)  were  hybridized 
onto  Human  Gene  1.0  ST  arrays  according  to  the 
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Table  1.  Clinicopathologic  features  of  patients  with  NSCLCs  included  in  the  study 

Months  to 
inclusion 


Patient 

Histology 

Anatomic 

site^ 

Stage 

Recurrence 

Adjuvant 

chemotherapy 

Gender/ 

age,y 

Smoking 

status'^ 

Pack- 

years 

Years  quit 
smoking‘s 

from 

surgery* 

1 

ADC 

LUL 

IIA 

Yes 

No 

Male/81 

Former 

100 

12.0 

1 

2 

ADC 

RLL 

IB 

No 

No 

Male/64 

Former 

27 

19.1 

5 

3 

ADC 

RUL 

IB 

Yes 

Yes 

Male/58 

Former 

12 

25.5 

7 

4 

SCC 

RLL 

IB 

Yes 

Yes 

Female/62 

Former 

60 

7.6 

7 

5 

ADC 

RLL 

lA 

No 

No 

Female/60 

Former 

31 

11.8 

6 

6 

ADC 

LLL 

lA 

Yes 

Yes 

Male/62 

Former 

40 

5.8 

5 

7 

ADC 

RUL 

IB 

Yes 

Yes 

Male/61 

Current 

70 

NA 

10 

8 

SCC 

LUL 

IB 

No 

No 

Male/64 

Current 

100 

NA 

5 

9 

ADC 

RUL 

lA 

No 

No 

Male/69 

Former 

12 

36.5 

8 

10 

ADC 

RUL 

IB 

No 

No 

Female/63 

Former 

18 

8.6 

5 

11 

ADC 

RUL 

IB 

Yes 

No 

Female/53 

Former 

42 

5.8 

3 

12 

SCC 

RUL 

lA 

No 

No 

Male/62 

Former 

68 

0.4 

5 

13 

SCC 

LUL 

lA 

No 

No 

Male/70 

Former 

96 

1.4 

7 

14 

ADC 

LUL 

lA 

Yes 

Yes 

Female/45 

Former 

12.5 

0.9 

12 

15 

ADC 

RLL 

lA 

No 

No 

Female/57 

Current 

80 

NA 

4 

16 

ADC 

LLL 

lA 

No 

No 

Male/65 

Former 

48 

21.0 

7 

17 

ADC 

LLL 

lA 

No 

No 

Male/64 

Former 

92 

5.1 

7 

18 

ADC 

RML 

IB 

No 

No 

Male/71 

Former 

102 

2.3 

2 

19 

ADC 

LUL 

lA 

No 

No 

Female/66 

Former 

50 

18.5 

5 

Abbreviations:  ADC,  adenocarcinoma;  LLL,  left  lower  lobe;  LUL,  left  upper  lobe;  RLL,  right  lower  lobe;  RML,  right  middle  lobe;  RUL,  right 
upper  lobe;  SCC,  squamous  cell  carcinoma. 

^Location  of  primary  tumor  in  the  lung. 

“^Smoking  status  at  time  of  inclusion  into  the  study  (patient  7  quit  smoking  during  the  course  of  the  study). 

“^Years  from  smoking  cessation  to  time  of  inclusion  into  the  study. 

^Months  elapsed  from  surgery  to  time  of  inclusion  into  the  study/first  bronchoscopy  time  point. 


manufacturer's  instructions  (Affymetrix).  Hybridization 
cocktails  containing  samples,  control  oligonucleotide  and 
eukaryotic  hybridization  controls  in  addition  to  hybridiza¬ 
tion  mixes,  dimethyl  sulfoxide  and  nuclease-free  water  were 
heat  denatured  at  99''C  for  5  minutes,  cooled  to  for  5 
minutes,  and  finally  centrifuged  at  maximum  speed  for  1 
minute.  After  injecting  80  pL  of  the  hybridization  cocktails, 
arrays  were  incubated  for  1 7  hours  in  a  hybridization  oven 
set  to  a  temperature  of  with  60  rpm  rotation.  Arrays 
were  washed,  stained,  and  processed  using  Affymetrix  Gen- 
eChip  Fluidics  Station  450  systems  after  which  they  were 
imaged  using  Affymetrix  GeneChip  Scanner  3000  7G  for 
subsequent  generation  of  raw  data  (*GEL  files).  Raw  data 
were  quantified  using  Robust  Multichip  Array  (RMA)  back¬ 
ground  correction,  quantile  normalization,  and  RMA  probe- 
level  models  (RmaPlm)  and  summarization  methods. 
MIAME  compliant  metadata,  normalized  expression  values, 
and  391  GEL  files  were  submitted  to  the  Gene  Expression 
Omnibus  (GEO;  samples  GSM992943-GSM993345,  series 
GSE40407).  After  data  preprocessing  and  normalization,  a 
log2  transformation  was  applied.  Pathways  and  gene  inter¬ 
action  network  analyses  were  conducted  using  the  commer¬ 
cially  available  software  Ingenuity  Pathways  Analysis  (IPA, 
Redwood  Gity,  GA).  All  details  of  the  microarray  analysis 


including  pairwise  analysis  of  adjacent  and  contralateral 
airways  for  patient  clustering  are  included  in  the  Supple¬ 
mentary  Information  and  in  the  4  Supplementary  Sweave 
Reports  accompanying  the  manuscript. 

IHC  analysis  of  airway  biopsies 

Immunohistochemistry  was  done  on  histologic  sections 
of  4-pm  formalin-fixed,  paraffin-embedded  tissue  samples 
prepared  by  a  tissue  arrayer  as  described  previously  (28). 
IHG  analysis  was  conducted  using  purified  rabbit  polyclon¬ 
al  primary  antibodies  raised  against  phospho-AKT(threo- 
nine308)  (1:200  dilution,  done  G31E5E,  catalog  number 
2965)  and  phospho-ERKl/2(Thr202/Tyr204)  (1:400  dilu¬ 
tion,  cIoneD13.14.4E,  catalog  number  43 70;  Gell  Signaling 
Technology).  Antigen  retrieval  was  conducted  using  the 
Dako  Target  Retrieval  System  at  a  pH  of  6  (Dako).  Intrinsic 
peroxidase  activity  was  blocked  by  3%  methanol  and  hydro¬ 
gen  peroxide  for  15  minutes  and  serum-free  protein  block 
(Dako)  was  used  for  7  minutes  for  blocking  nonspecific 
antibody  binding.  Slides  were  then  incubated  with  the 
antibodies  against  phospho-AKT  and  phospho-ERKl/2  at 
room  temperature  for  90  and  120  minutes,  respectively. 
After  3  washes  in  TBS,  slides  were  incubated  for  30  minutes 
with  Dako  Envision+  Dual  Link  at  room  temperature. 
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Figure  1.  Spatial  and  temporal  molecular  field  of  injury  in  patients  with  early-stage  NSCLCs  after  definitive  surgery.  Schematic  depicting  the  site-dependent 
(top)  and  time-dependent  (bottom)  collection  of  airway  epithelia  brushings  by  bronchoscopy.  Smoker  patients  with  early-stage  NSCLCs  were  enrolled  into  a 
surveillance  clinical  trial  for  annual  follow-up  and  bronchoscopies  within  1  year  after  definitive  surgery.  Bronchial  airway  epithelial  cells  (brushings  and 
biopsies)  were  obtained  from  5  to  6  different  sites  composed  of  the  main  carina  (MC),  4  airways  from  4  lobes  ipsilateral  (NON-ADJ),  or  contralateral  (CONTRA) 
to  the  originally  resected  tumor  and  of  the  bronchial  stump  or  main  stem  bronchus  adjacent  to  the  tumor  and  lobe.  All  site-different  airway  epithelia  were 
collected  at  the  time  of  inclusion  in  the  study  and  at  12,  24,  and  36  months  following  the  starting  time  point  (391  airways  from  1 9  patients  with  NSCLCs). 


Following  3  additional  washes,  slides  were  incubated 
with  Dako  chromogen  substrate  for  5  minutes  and  were 
counterstained  with  hematoxylin  for  another  5  minutes. 
Formalin-fixed,  paraffin-embedded  pellets  from  lung  can¬ 
cer  cell  lines  displaying  positive  phospho-AKT  and  phos- 
pho-ERKl/2  expression  by  Western  blot  analysis  were  used 
as  a  positive  control,  whereas  samples  and  whole-section 
tissue  specimens  processed  similarly,  except  for  the  omis¬ 
sion  of  the  primary  antibodies  used  as  negative  controls. 
The  intensity  and  extent  of  cytoplasmic  and  nuclear  phos- 
pho-AKT  and  phospho-ERKl/2  immunostaining  were  eval¬ 
uated  using  a  light  microscope  (magnification,  x20)  inde¬ 
pendently  by  2  pathologists  (J.  Eujimoto  and  1. 1.  Wistuba) 
who  were  blinded  to  the  identity  of  the  samples.  Cyto¬ 
plasmic  expression  was  quantified  using  a  4-value  intensity 
score  (0,  none;  1,  weak;  2,  moderate;  and  3,  strong)  and  the 
percentage  (0%-100%)  of  the  extent  of  reactivity.  A  final 
cytoplasmic  expression  score  was  obtained  by  multiplying 
the  intensity  and  reactivity  extension  values  (range,  0-300). 
Nuclear  expression  score  was  quantified  using  the  percent¬ 
age  of  extent  of  reactivity  (range,  0-100). 

Summary  statistics,  including  frequency  tabulation, 
means,  SDs,  median,  and  range,  were  given  to  describe 
subject  characteristics  and  IHC  protein  expression.  Repeat¬ 
ed  measures  analysis  was  conducted  to  assess  the  differen¬ 
tial  effect  of  sites  on  phosphorylated  ART  and  extracellular 
signal -regulated  kinase  (ERK)  1/2  expression  variation  with 
time.  Mked-effects  models  were  generated  to  assess  signif¬ 
icance  of  site,  time,  and  the  interaction  of  both  factors  to  the 
expression  of  both  proteins.  All  statistical  tests  were  2-sided 


and  P  <  0.05  was  considered  to  be  statistically  significant. 
Statistical  analysis  was  conducted  with  standard  statistical 
software,  including  SAS  Release  8.1  (SAS  Institute)  and  S- 
Plus  2000  (Mathsoft  Inc.). 

Results 

Detailed  site-  and  time-dependent  airway  sampling  of 
the  field  of  injury  in  early-stage  NSCLC  patients  after 
definitive  surgery 

Expression  patterns  molecularly  exemplifying  the  impact 
of  smoking  on  the  airway  epithelium  of  cancer-free  indivi¬ 
duals  have  been  characterized  (8, 19,  29).  Moreover,  molec¬ 
ular  abnormalities  typically  found  in  lung  tumors  have  been 
detected  in  normal  resected  margins  suggestive  of  a  field 
effect  (15,  25-27,  30,  31).  However,  the  biologic  nature  of 
the  field  of  injury  in  particular  after  complete  removal  of  the 
tumor  in  patients  with  early-stage  NSCLCs,  who  are  increas¬ 
ing  in  number  and  for  whom  there  are  no  chemoprevention 
strategies,  are  yet  unknown.  Smoker  patients  with  early- 
stage  NSCLCs  were  recruited  on  a  prospective  phase  II 
surveillance  clinical  trial  that  included  frequent  CT  scans, 
serologies,  and  annual  bronchoscopies  in  which  airway 
brushings  and  biopsies  were  obtained  within  1  year  fol¬ 
lowing  tumor  resection  and  at  12,  24,  and  36  months 
following  the  first  time  point  (fig.  1).  The  first  time  point 
bronchoscopies  were  all  conducted  within  1  year  of  defin¬ 
itive  surgery  (average,  6  months;  range,  1-12).  Nineteen 
patients  were  selected  for  the  study  (Table  1 )  on  the  basis  of 
airway  sampling  of  at  least  5  different  sites  per  time  point 
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and  continuously  up  to  24  or  36  months  giving  rise  to  391 
airway  samples  for  transcript-level  expressing  profiling.  The 
patients  were  accrued  between  2004  and  2008  and  were 
composed  of  former  {n  =  16)  and  current  (n  =  3)  smokers. 
Brushings  and  biopsies  were  obtained  from  the  main  carina, 
airways  relatively  adjacent  to  the  previously  resected  tumor 
and  from  airways  more  distant  from  the  tumor  in  the 
ipsilateral  and  contralateral  lung  (Fig.  1). 

Following  normalization  of  the  raw  expression  data,  a 
mixed-effects  model  was  applied  to  characterize  the  expres¬ 
sion  pattern  of  each  gene  that  incorporated  fixed-effects 
such  as  the  site  from  the  tumor  and  time  after  surgery  of  the 
collected  airway  samples  (Supplementary  Information). 
Histogram  P  value  distribution  plots  after  fitting  beta-uni¬ 
form  rnkture  (BUM)  models  (32)  for  derived  P  values  on 
the  basis  of  the  fixed-effects  suggested  that  both  site  and 
time  of  the  airway  samples  influenced  gene  expression 
modulation  (Supplementary  Fig.  SI  A  and  SIB). 

Site-dependent  differential  expression  patterns  in 
airway  epithelia  of  early-stage  NSCLC  patients 

We  first  sought  to  determine  whether  gene  expression 
profiles  are  differentially  expressed  in  airways  by  site  from 
the  tumor  including  those  relatively  adjacent  to  the  resected 
tumors  compared  with  more  distant  airways.  On  the  basis 
of  the  generated  BUM  models  and  P  value  distributions, 
genes  differentially  expressed  by  site  were  selected  on  the 
basis  of  a  1%  false  discovery  rate  (FDR).  We  identified  1, 1 65 


gene  features  that  were  statistically  significantly  differen¬ 
tially  expressed  by  site  (Supplementary  Table  SI  and  Sup¬ 
plementary  Sweave  Report  S2).  Two-dimensional  hierar¬ 
chical  clustering  showed  that  the  airway  samples  were 
divided  into  2  main  branches  or  clusters  (Fig.  2A)  on  the 
basis  of  expression  of  the  genes.  Moreover,  the  left  cluster  in 
the  indicated  heatmap's  dendogram  contained  a  statistical¬ 
ly  significantly  higher  number  of  adjacent  airway  samples 
than  the  right  branch  which  contained  a  significantly  higher 
proportion  of  main  carinas  and  contralateral  to  the  tumor 
airways  (P  =  0.0027  of  the  Fisher  exact  test  for  count  data). 
In  addition,  the  2-dimensional  clustering  revealed  8  differ¬ 
ent  gene  expression  patterns  which  are  indicated  by  the 
color  bar  and  code  along  the  left  side  of  the  heatmap  (Fig. 
2A).  Some  of  the  different  gene  clusters  or  classes  were 
associated  with  a  specific  group  of  airway  samples.  Notably, 
a  cluster  of  263  genes  (cluster  1,  Supplementary  Table  SI), 
indicated  by  the  dark  green  color  and  asterisk  on  the 
heatmap,  was  found  to  have  highest  average  expression  in 
adjacent  airways  (Fig.  2A  and  Supplementary  Table  S2).  In 
contrast,  another  cluster  of 3 48  genes  (indicated  by  magenta 
color)  appeared  to  be  highest  in  expression  in  main  carinas 
(Fig.  2A). 

We  then  determined  to  functionally  analyze  the  cluster  of 
genes  (n  =  263)  that  was  found  to  exhibit  the  highest 
average  expression  in  adjacent  airways  between  adjacent 
and  contralateral  airways.  Functional  pathways  analysis 
using  IPA  depicted  several  significantly  modulated 
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Figure  2.  Site-dependent  airway  epithelia  differential  gene  expression  patterns.  A,  heatmap  depicting  2-dimensional  clustering  of  airway  samples  (n  =  391 )  and 
genes  (n  =  1 ,165)  that  were  determined  to  be  differentially  expressed  by  site  in  the  mixed-effects  model  on  the  basis  of  a  1  %  FDR  cutoff.  The  identified  8  gene 
clusters  are  labeled  with  different  colors  with  the  green  cluster  of  genes  (n  =  263,  *A)  exhibiting  highest  average  expression  in  adjacent  airways  and 
the  magenta  cluster  (n  =  348,  *MC)  having  highest  expression  in  main  carinas.  B,  gene  interaction  analysis  by  IPA  depicting  networks  with  significant 
scores  that  indicate  the  likelihood  of  genes  in  a  network  being  found  together  than  due  to  chance.  The  depicted  networks  were  found  to  be  mediated 
by  PI3K  (top)  and  ERK1/2  (bottom)  with  both  kinases  themselves  not  modulated  in  expression.  Gene  expression  variation  based  on  the  statistical  cutoff 
described  above  is  depicted  by  color  in  the  network  (red,  upregulated;  green,  downregulated). 
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pathways  and  molecular  functions  in  particular,  T  cell  (P  = 
1.2  X  10“^),  chemokine  C-C  motif  receptor  5  (CCR5;  P  = 
1.5  X  10“^),  and  phospholipase  C  signaling  (P=  3.6  X  10“^). 
In  addition,  topological  gene  interaction  network  analysis 
highlighted  functionally  modulated  and  upregulated  gene 
networks  mediated  by  phosphoinositide  3-kinase  [PBK) 
and  ERK  in  the  adjacent  airways  (Fig.  2B).  It  is  worthwhile 
to  note  that  we  also  observed  increased  modulation  of 
PPK/mitogen- activated  protein  kinase  (AlAPfC) -mediated 
gene  interaction  network  using  a  different  analytic  method 
where  we  compared  expression  profiles  between  adjacent 
and  contralateral  airways  between  patients  in  a  pairwise 
fashion  (data  not  shown.  Supplementary  Sweave  Report  S3). 
These  findings  suggest  that  airway  site-dependent  dif¬ 
ferential  gene  expression  profiles  in  patients  with  early-stage 
NSCLCs  exhibit  increased  molecular  features  and  gene 
interaction  networks  typically  associated  with  cancers. 

We  then  determined  to  analyze  gene  expression  profiles 
while  excluding  main  carinas  because  their  epithelial 
anatomy  is  suggested  to  be  different  from  that  of  other 
airways  (33)  and  thus  may  confound  site-dependent 
observations.  Following  exclusion  of  main  carinas,  we 
found  a  reduced  number  of  genes  {n  =  136)  that  were 
significantly  modulated  by  site  in  the  field  of  injury 
(Supplementary  Sweave  Report  S4).  Two-dimensional 
hierarchical  clustering  showed  that  airway  samples  were 
divided  into  2  main  branches  or  clusters  (Supplementary 
Fig.  S2)  with  significantly  more  adjacent  airways  in  the 
right  branch  (P  =  0.0002  of  the  Fisher  exact  t  test).  More¬ 
over,  the  differentially  expressed  genes  (Supplementary 
Table  S4)  were  composed  of  2  main  subgroups  with  one 
cluster  (top  cluster)  of  113  genes  exhibiting  highest  aver¬ 
age  expression  in  adjacent  airways.  It  is  important  to  note 
that  when  we  cross-compared  gene  clusters  that  we  had 
found  to  exhibit  the  highest  average  expression  in  adja¬ 


cent  airways  when  including  (263 -gene  cluster)  or  exclud¬ 
ing  (113-gene  cluster)  main  carinas,  we  found  a  highly 
significant  overlap  (P  =  2.46  x  10“^^^)  in  the  number  of 
genes  [n  =  96).  Moreover,  the  site-dependent  genes 
identified  after  exclusion  of  main  carinas  were  also  topo¬ 
logically  organized  following  functional  pathways  ana¬ 
lysis  into  interaction  networks  involving  PI3K  and  ERK. 

Gene  expression  profiles  in  the  lung  airway  epithelia  of 
early-stage  NSCLC  patients  are  modulated  with  time 
following  definitive  surgery 

We  then  determined  to  identify  genes  that  were  differ¬ 
entially  expressed  with  time.  On  the  basis  of  the  generated 
BUM  models  and  P  value  distributions,  time-dependent 
differentially  expressed  genes  were  identified  and  selected 
on  the  basis  of  a  5%  FDR  cutoff  (n  =  1,395;  Supplementary 
Table  S3  and  Supplementary  Sweave  Report  S2).  Hierarchi¬ 
cal  clustering  of  samples  indicated  that  the  dendogram's 
main  branches  were  statistically  significantly  unbalanced 
with  respect  to  time;  the  main  left  branch  comprised  a 
higher  number  of  24-  and  36-month  time  points  than  the 
right  cluster  (P  =  4.15  x  10“^  of  the  Fisher  exact  test  for 
count  data).  Two-dimensional  clustering  of  both  genes  and 
samples  revealed  2  main  classes  of  genes:  those  that  dis¬ 
played  increased  (upper  cluster)  and  decreased  (lower  clus¬ 
ter)  expression  with  time  (Fig.  3A).  Functional  pathways 
and  gene  interaction  network  analysis  of  genes  differentially 
expressed  between  36  months  and  the  first  time  point  re¬ 
vealed  statistically  significantly  modulated  pathways,  in 
particular  protein  ubiquitination  (5.3  x  10“^),  glutathione 
metabolism  (6.0  x  10“^),  mitochondrial  dysfunction 
(1.4  X  10“'^),  and  oxidative  phosphorylation  (2.9  x 
10“^)  as  well  as  eukaryotic  initiation  factor  2  (eIF2)  signal¬ 
ing  (2.6  X  10“^).  In  addition,  network  analysis  highlighted 
functionally  modulated  and  elevated  gene  interaction 


Figure  3.  Temporal  modulation  of 
the  molecular  field  of  injury  after 
definitive  surgery  in  patients  with 
early-stage  NSCLCs.  A,  heatmap 
depicting  2-dimensional  clustering 
of  airway  samples  (n  =  391)  and 
genes  (n  =  1 ,395)  that  were 
determined  to  be  differentially 
expressed  by  time  in  the  mixed- 
effects  model  on  the  basis  of  a  5% 
FDR  cutoff.  B,  gene  interaction 
analysis,  similar  to  that  in  Fig.  2, 
by  IPA  depicting  networks  with 
increased  likelihood  of  genes  being 
found  together  than  due  to  chance 
and  mediated  by  NF-kB,  ERK1I2, 
AKT,  and  CCND1 .  CCND1  itself  was 
upregulated  at  the  expression  level. 
Gene  expression  variation  based 
on  the  statistical  cutoff  is  depicted 
by  color  in  the  network  (red, 
upregulated;  green,  downregulated). 
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networks  with  time  in  particular  those  mediated  by  NF-kB, 
ERK,  ART,  and  cyclin-Bl  {CCNBl;  Fig.  3B). 

We  then  sought  to  determine  the  relationship  of  genes 
that  were  significantly  modulated  by  site  and  time  in  the 
molecular  field  of  injury.  A  smooth  scatter  plot  of  trans¬ 
formed  P  values  indicated  that  site-  and  time-dependent 
expression  modulations  were  largely  independent  (Supple¬ 
mentary  Fig.  S3  and  Supplementary  Sweave  Report  S2).  We 
then  cross-compared  the  site-dependent  {n  =  1,165)  and 
time-dependent  (n  =  1,395)  profiles  that  we  had  noted  in 
the  molecular  field  of  injury.  Using  hypergeometric  tests  for 
overlapping  probability,  we  found  no  significant  overlap 
between  genes  we  had  determined  to  be  differentially 
expressed  by  site  and  time  in  the  molecular  field  of  injury 
[P  =  0.865;  Supplementary  Sweave  Report  S4). 

Increased  expression  of  phosphorylated  AKT  and 
ERKl/2  in  airway  epithelia  by  site  from  the  tumor  and 
with  time  following  surgery 

Our  findings  on  the  modulation  of  P13K-  and  ERKl/2- 
mediated  networks  by  site  and  time  after  surgery  prompted 
us  to  examine  the  IHC  expression  of  surrogate  markers  of 
both  signaling  cascades  in  corresponding  formalin-fixed, 
paraffin-embedded  airway  biopsies.  We  sought  to  examine 
expression  of  phosphorylated  AKT  atThreonine308  because 
phosphorylation  of  this  amino  acid  is  well  known  to  occur 
through  phosphoinositide-dependent  kinase  1  (PDPKl) 


following  PI3K  activation  (34).  We  assessed  by  immuno- 
histochemistry  the  IHC  expression  of  phospho-AKT 
(Thr308)  and  phospho-ERKl/2(Thr202/Tyr204)  in  avail¬ 
able  and  evaluable  histologically  NBE  biopsies  {n  =  324) 
corresponding  to  the  brushings  analyzed  by  expression 
profiling.  Immunoreactivity  of  phospho-AKT  (minimum, 
0;  maximum,  300)  was  variable  as  depicted  in  the  repre¬ 
sentative  photomicrographs  in  Fig.  4A  and  was  detected  in 
both  the  cytoplasm  and  nucleus  of  NBE  (Eig.  4A).  IHC 
analysis  showed  that  cytoplasmic  (P  <  0.0001)  and  nuclear 
(P  =  0.01)  phospho-AKT  statistically  significantly  increased 
with  time  up  to  3  years  in  all  NBE  (Eig.  4B)  with  highest 
expression  at  the  36-month  time  point.  Nuclear  phosphor¬ 
ylated  AKT  was  also  statistically  significantly  increased  in 
adjacent  NBE  compared  with  airways  more  distant  from 
tumors  in  the  mbced-effects  model  (P  =  0.04;  Eig.  4B). 

Immunoreactivity  of  phospho-ERKl/2  was  also  variable 
(minimum,  0;  maximum,  209)  and  mainly  cytoplasmic 
(Eig.  4C).  IHC  analysis  showed  that  phosphorylated  ERKl/ 
2  was  statistically  significantly  elevated  in  adjacent  NBE  (P  = 
0.03)  and  significantly  increased  with  time  up  to  3  years  in 
all  airways  when  averaged  together  (P  =  0.02;  Eig.  4D)  in  the 
model.  Notably,  there  was  a  significant  interaction  (P  = 
0.019)  between  the  site  of  NBE  and  the  time  of  sampling,  as 
phospho-ERKl/2  expression  was  significantly  increased 
with  time  in  adjacent  NBE  but  not  in  contralateral  airways 
and  main  carinas  in  the  model  (Eig.  4D)  with  highest 
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Figure  4.  Site-  and  time-dependent  IHC  expression  of  phosphorylated  AKT  and  ERK1/2  in  the  airway  field  of  injury.  A,  representative  photomicrographs 
(magnification,  x20)  depicting  strong  (top)  and  weak  (bottom)  phospho-AKT(Thr308)  immunostaining.  B,  IHC  scores  of  cytoplasmic  (left)  and  nuclear 
phospho-AKT  (middle)  were  assessed  for  statistically  significant  differences  by  site  and  time  in  a  mixed-effect  model  and  plotted  in  main  carinas  (MC)  and  in 
adjacent  (ADJ),  nonadjacent  (ipsilateral,  NON-ADJ),  and  contralateral  (CONTRA)  airways  with  time.  C,  representative  photomicrographs  (magnification,  x20) 
depicting  strong  (top)  and  weak  (bottom)  phospho-ERK1/2(Thr202/Tyr204)  immunostaining.  D,  IHC  scores  of  phosphorylated  ERK1/2  levels  were 
assessed  for  statistically  significant  differences  by  site  and  time  in  a  mixed-effect  model  and  repeated  measures  analysis,  site  x  time,  term  for  interaction 
between  site  and  time  factors.  Error  bars  represent  SEM. 
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expression  in  adjacent  airways  and  nonadjacent  (ipsilate- 
ral,  green  plot)  airways  observed  at  the  latest  time  point. 
Similar  data  were  obtained  when  we  excluded  main  carinas 
in  the  mixed-effects  model  (data  not  shown).  Moreover, 
we  noted  similar  findings  when  all  airway  samples  were 
analyzed  irrespective  of  the  presence  of  preneoplastic 
lesions  (e.g.,  dysplasias). 

These  data  showed  that  like  differential  gene  expression 
profiles  within  the  lung  airway  field  of  injury,  canonical 
activated  oncogenes  are  modulated  by  site  from  the 
resected  tumor  and  time  following  definitive  surgery  in 
patients  with  early-stage  NSCLCs. 

Discussion 

In  this  report,  we  characterized  differential  expression 
profiles  and  protein  expression  within  the  lung  airway  field 
of  injury  of  patients  with  early-stage  NSCLCs  by  site  from 
the  tumor  and  time  in  years  following  surgery.  We  showed, 
and  to  our  knowledge  for  the  first  time,  that  gene  expression 
profiles  in  histologically  normal  airways  of  patients  with 
early-stage  NSCLCs  are  nonuniform  by  site  and  are  mod¬ 
ulated  with  time  up  to  3  years  following  surgery.  Moreover, 
functional  analysis  of  the  expression  profiles  showed  that 
canonical  expression  patterns  and  protein  kinase  activation, 
typical  of  tumors,  are  increased  in  airway  sites  adjacent  to 
tumors  as  well  as  remain  or  are  further  modulated  in  the 
lung  airway  field  of  injury  for  three  years  after  definitive 
surgery.  In  particular,  phosphorylated  ERK/MAPK  and  AKT 
expression  were  upregulated  in  NBE  with  time  and  by  site 
from  tumors.  In  light  of  the  growing  subpopulation  of  early- 
stage  NSCLCs,  our  findings  are,  in  part,  proof-of-principle 
and  raise  the  intriguing  possibility  of  the  importance  of 
intense  surveillance  and  molecular  characterization  of  the 
remaining  smoking-injured  airway  epithelia  and  its  poten¬ 
tial  integration  in  the  future  into  clinical  practice  and 
management  of  early-stage  disease.  However,  it  is  notewor¬ 
thy  that  our  study's  patient  cohort,  despite  its  uniqueness  in 
which  expression  profiling  was  serially  conducted  on  air¬ 
ways  from  multiple  sites  collected  during  bronchoscopies 
for  36  months  following  surgery,  is  of  limited  size.  More¬ 
over,  the  reported  findings  warrant  the  need  for  validation 
or  confirmation  in  independent  larger  sets. 

There  is  a  large  body  of  evidence  that  patients  who  have 
survived  an  upper  aero  digestive  cancer  comprise  a  high-risk 
population  that  may  be  targeted  for  early  detection  and 
chemoprevention  efforts  (5,  31).  Currently,  there  are  no 
established  adjuvant  treatments  in  the  tertiary  prevention 
setting  of  patients  with  early-stage  NSCLCs.  It  has  been 
suggested  that  failures  in  advances  of  chemoprevention  are, 
in  part,  due  to  the  lack  of  clear  and  specific  molecular  targets 
(5,  35).  Our  extensive  profiling  of  the  molecular  field  of 
injury  in  patients  with  NSCLCs  identified  cancer-associated 
pathways  (PI3K  and  ERK)  aberrantly  regulated  in  NBE  of 
patients  with  NSCLCs  after  tumor  removal.  In  this  context, 
it  is  plausible  to  suggest  that  a  thorough  characterization  of 
the  molecular  field  of  injury  in  patients  with  early-stage 
NSCLCs  can  aid  in  identification  of  aberrantly  expressed 
pathways,  for  example,  PI3K  and  ERK,  which  could  poten¬ 


tially  serve  as  suitable  targets  for  chemoprevention.  How¬ 
ever,  it  cannot  be  neglected  that  the  alternative  hypothesis 
can  counter  argue  that  activation  of  PI3K  and  ERK/MAPK 
pathways  may  be  beneficial  for  chemoprevention  as  mar¬ 
kers  of  both  pathways  (phosphorylated  AKT  and  ERK) 
increased  following  surgical  tumor  resection.  Our  sugges¬ 
tion  that  such  pathways  may  serve  as  chemoprevention 
targets  should  be  interpreted  cautiously  and  is  presented 
owing  to  the  known  promalignant  function  of  the  pathways 
and  gene  networks  highlighted  in  our  analysis  (36). 

The  first  time  point  brushings  in  this  study  were  obtained 
within  1  year  of  patient  accrual  and  not  at  the  time  when  the 
tumor  was  still  in  situ.  The  variable  starting  time  point  from 
surgery  among  patients  is  a  limiting  factor  in  our  analysis  as 
it  is  plausible  to  assume  that  the  molecular  field  of  injury 
may  vary  upon  tumor  removal.  We  were  not  able  to  avoid 
this  caveat  given  the  difficulty  of  accrual  of  patients  follow¬ 
ing  surgical  tumor  resection  to  obtain  bronchial  brushings 
at  6  different  anatomical  sites  in  the  bronchial  tree  annually 
up  to  3  years.  However,  it  is  important  to  mention  that  the 
time-dependent  gene  expression  profiles  we  had  identified, 
despite  not  incorporating  the  molecular  field  effect  at  the 
time  the  tumor  was  present,  showed  gradual  changes  in 
expression  with  time.  This  effect  was  also  evident  when 
analyzing  the  IHC  expression  of  phosphorylated  AKT  and 
ERK  in  corresponding  bronchial  biopsies  with  highest 
expression  at  3  years.  Our  findings  warrant  future  larger 
studies  in  which  the  molecular  field  of  injury  at  the  time  the 
tumor  is  still  in  situ  can  also  be  serially  monitored  for  several 
years. 

The  patient  population  we  had  studied  was  composed 
of  patients  with  early-stage  NSCLCs,  which  is  in  contrast 
to  earlier  transcriptomic  studies  of  the  molecular  field  of 
injury  that  focused  on  phenotypically  normal  smokers 
and  nonsmokers  (19,  21,  22).  It  is  still  not  clear  whether 
the  differences  in  expression  described  in  this  study  reflect 
an  already  present  gradient  field  of  injury  that  may  have 
contributed  to  tumor  development  in  light  of  the  differ¬ 
ential  cancer-associated  pathways  identified  or  one  that 
arises  due  to  the  molecular  impact  of  the  tumor  on  the 
adjacent  field.  It  is  important  to  note  that  in  this  study, 
the  spatial  and  temporal  molecular  field  of  injury  in 
patients  with  lung  cancer  was  profiled  prospectively  start¬ 
ing  within  1  year  following  definitive  surgery.  Thus,  the 
above  speculation  may  be  addressed  by  a  similar  thor¬ 
ough  spatial  and  temporal  characterization  of  the  molec¬ 
ular  field  of  injury  before  and  after  surgery  in  early-stage 
patients.  In  addition,  we  did  not  have  access  to  similar 
type  of  brushings  from  cancer-free  individuals  such  as 
high-risk  smokers.  Similar  analysis  of  the  molecular  field 
of  injury  in  cancer- free  individuals  will  shed  light  on  the 
nature  of  site-  and  time-dependent  expression  patterns  in 
the  field  of  injury  and  whether  such  changes  in  patients 
with  cancer  reflect  recovery  from  surgery  (temporal  pro¬ 
files)  or  are  a  cause  or  consequence  of  tumor  development 
in  the  adjacent  field  (spatial  profiles). 

In  conclusion,  our  unique  study  identified  gene  expres¬ 
sion  profiles,  functional  gene  networks,  and  activated  levels 
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of  oncogenic  protein  kinases  within  the  field  of  injury  of 
patients  with  early-stage  NSCLCs  that  are  modulated  or 
increased  in  airways  spatially  from  the  tumor  and  tempo¬ 
rally  following  surgery.  Moreover,  the  herein  previously 
uncharacterized  airway  cancer-associated  expression  and 
protein  kinase  alterations  harbor  potential  valuable  targets 
for  chemoprevention  and  warrant  confirmation  and  further 
studies  in  larger  independent  cohorts. 
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Earlier  work  identified  specific  tumor-promoting  abnormalities  that  are  shared  between  lung  cancers  and  adja¬ 
cent  normal  bronchial  epithelia.  We  sought  to  characterize  the  yet  unknown  global  molecular  and  adjacent  airway 
field  cancerization  (FC)  in  early-stage  non-small  cell  lung  cancer  (NSCLC). 

Whole-transcriptome  expression  profiling  of  resected  early-stage  (l-IIIA)  NSCLC  specimens  (n  =  20)  with  matched 
tumors,  multiple  cytologically  controlled  normal  airways  with  varying  distances  from  tumors,  and  uninvolved 
normal  lung  tissues  (n  =  194  samples)  was  performed  using  the  Affymetrix  Human  Gene  1.0  ST  platform.  Mixed- 
effects  models  were  used  to  identify  differentially  expressed  genes  among  groups.  Ordinal  regression  analysis 
was  performed  to  characterize  site-dependent  airway  expression  profiles.  All  statistical  tests  were  two-sided, 
except  where  noted. 

We  identified  differentially  expressed  gene  features  (n  =  1661)  between  NSCLCs  and  airways  compared  with 
normal  lung  tissues,  a  subset  of  which  (n  =  299),  after  gene  set  enrichment  analysis,  statistically  significantly 
(P<  .001)  distinguished  large  airways  in  lung  cancer  patients  from  airways  in  cancer-free  smokers.  In  addition, 
we  identified  genes  (n  =  422)  statistically  significantly  and  progressively  differentially  expressed  in  airways  by 
distance  from  tumors  that  were  found  to  be  congruently  modulated  between  NSCLCs  and  normal  lung  tissues. 
Furthermore,  LAPTM4B,  with  statistically  significantly  increased  expression  (P<  .05)  in  airways  with  shorter  dis¬ 
tance  from  tumors,  was  upregulated  in  human  immortalized  cells  compared  with  normal  bronchial  epithelial  cells 
(P<  .001)  and  promoted  anchorage-dependent  and  -independent  lung  cancer  cell  growth. 

The  adjacent  airway  FC  comprises  both  site-independent  profiles  as  well  as  gradient  and  localized  airway  expres¬ 
sion  patterns.  Profiling  of  the  airway  FC  may  provide  new  insights  into  NSCLC  oncogenesis  and  molecular  tools 
for  detection  of  the  disease. 
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Earlier  work  by  Slaughter  et  ah  in  patients  with  oral  premalignant 
and  cancer  lesions  suggested  that  histologically  normal-appear¬ 
ing  tissues  adjacent  to  lesions  display  tumor-associated  molecu¬ 
lar  abnormalities  (1).  Notably  Auerbach  et  ah  demonstrated  that 
cigarette  smoke  induces  widespread  histological  changes  and  pre¬ 
malignant  lesions  in  the  bronchial  epithelia  in  the  lungs  of  smok¬ 
ers,  suggestive  of  a  field  effect  (2).  This  phenomenon,  coined  field 
cancerization  (FC),  was  shown  to  be  evident  in  various  epithelial 
malignancies,  including  gastric,  esophageal,  hepatic,  cervical,  skin, 
and  lung  cancers  (3-6)  and  was  proposed  to  precede  and  explain  the 
development  of  multiple  primary  and  locally  recurrent  cancer  (3,7). 

Previously,  an  analysis  of  histologically  normal  epithelium  and 
premalignant  and  malignant  epithelia  from  lung  squamous  cell  car¬ 
cinoma  (SCC)  patients  indicated  that  multiple,  sequentially  occur¬ 
ring  allele-specific  chromosomal  deletions  commence  early  in  the 


multistage  pathogenesis  of  SCCs  (8,9).  Notably,  31%  of  histologi¬ 
cally  normal  epithelium  specimens  had  clones  of  cells  with  allelic 
loss  at  one  or  more  regions  examined,  including  loss  of  heterozygo¬ 
sity  at  chromosomal  regions  3p  and  9p  (8,10).  Belinsky  et  ah  identi¬ 
fied  promoter  methylation  of  pi  6,  a  common  aberration  in  lung 
tumors  (11),  in  at  least  one  bronchial  epithelial  site  from  44%  of 
lung  cancer  cases  examined  (12).  In  addition,  our  group  and  others 
have  demonstrated  that  mutations  in  the  epidermal  growth  factor 
receptor  (EGFR)  and  KRAS  oncogenes  were  also  found  in  histo¬ 
logically  normal  tissue  adjacent  to  lung  tumors  (13,14). 

Global  expression  profiles  have  been  described  in  bronchial 
epithelium  of  smokers,  a  portion  of  which  exhibited  cancer  diag¬ 
nostic  properties  (15-18),  as  well  as  in  the  field  of  injury  of  early- 
stage  non-small  cell  lung  cancer  (NSCLC)  patients  who  had  their 
tumors  surgically  resected  before  airway  transcriptome  analysis 
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(19).  However,  the  adjacent  airway  FC  in  NSCLC  has  not  yet  been 
characterized  at  a  whole-transcriptome  level.  In  this  study,  we  per¬ 
formed  expression  profiling  of  matched  NSCLCs,  uninvolved  nor¬ 
mal  lung  tissue,  and  multiple  airways  with  varying  distances  from 
tumors  to  define  the  transcriptomic  architecture  of  the  adjacent 
ainvay  FC. 

Methods 

Lung  Tumor  Resected  FC  Specimens  and  Airway 
Epithelial  Cell  Collection 

The  FC  specimens,  comprised  of  lung  tumors,  uninvolved  normal 
lung  parenchyma,  and  multiple  normal-appearing  airways  with 
varying  distances  from  tumors,  were  obtained  from  early  stage  (I- 
IIIA)  patients  at  MD  Anderson  Cancer  Center.  Tumor  stage  was 
classified  as  described  previously  (20).  The  study  was  approved 
by  the  institutional  review  boards,  and  all  participants  provided 
written  informed  consent.  Malignant  and  paired  normal  lung  tis¬ 
sues  from  each  case  patient  were  obtained  snap-frozen,  preserved 
in  Mater  or  by  surface  brushing.  For  each  tissue  sample,  the 
percentage  of  malignant  tissue  was  calculated  by  histological 
examination  (J.  Fujimoto)  after  hematoxylin  and  eosin  staining.  All 
malignant  samples  contained  more  than  40%  tumor  cells.  Twenty 
NSCLC  FC  case  patients  were  included  in  the  study  and,  along 
with  their  clinicopathological  information,  are  summarized  in 
Supplementary  Table  1  (available  online). 

Airway  epithelia  were  obtained  by  brushing  three  to  five  sequen¬ 
tial  bronchiolar  structures  with  varying  distances  from  tumors 
(Supplementary  Figure  1,  available  online)  using  sterile  Cytosoft 
cytology  brushes  (Medical  Packaging  Corporation,  Camarillo,  CA). 
The  spatial  distance  between  two  consecutive  airway  brushings  was 
similar  (approximately  2  cm).  Airways  were  denoted  by  numbers  1 
(relatively  closest  from  tumor)  to  5  (relatively  farthest).  The  rela¬ 
tive  distance  of  an  airway  brushing  (eg,  aii'way  1)  from  the  adjacent 
NSCLC  tumor  was  similar  across  all  case  patients.  Airway  brush¬ 
ings  were  placed  in  Qiazol  lysis  buffer  (Qiagen,  Valencia,  CA)  in  dry 
ice  and  immediately  stored  at  -80°C.  Confirmation  of  epithelial 
cell  collection  by  pan-cytokeratin  immunohistochemical  analysis, 
as  well  as  lack  of  neoplastic  or  preneoplastic  cells  (Supplementary 
Figure  1,  available  online),  was  performed  as  described  in  the 
Supplementary  Methods  (available  online). 

Microarray  Data  Analysis 

RNA  samples  were  processed  for  microarray  expression  profiling 
using  the  Affymetrix  Human  Gene  1.0  ST  platform  (Affymetrix, 
Santa  Clara,  CA)  (Supplementary  Methods,  available  online).  Raw 
data  were  quantified  using  background  correction,  quantile  nor¬ 
malization,  and  robust  multichip  analysis  (21)  probe-level  mod¬ 
els  and  summarization  methods.  Minimum  Information  About  a 
Microarray  Fxperiment  (MIAMF)- compliant  data  were  submitted 
to  the  Gene  Fxpression  Omnibus  under  series  GSF44077  (samples 
GSMl 07 784^GSM  1078069).  Basic  quality  control  was  assessed 
by  graphical  summaries  of  array  intensities  and  Bland-Altman  (M 
vs  A)  plots  (Supplementary  Methods,  Sweave  Report  1,  available 
online).  Linear  mixed-effects  models  were  used  to  characterize 
gene  features  concordantly  differentially  expressed  (in  the  same 
direction)  between  both  NSCLCs  and  airways  compared  with 


normal  lung  tissues  (site-independent  analysis).  The  different 
groups  were  set  as  fixed  effects,  whereas  patients  were  set  as  ran¬ 
dom  factors  in  the  mixed-effects  models.  An  ordinal  logistic  regres¬ 
sion  model  (22-24)  was  applied  to  identify  gene  features  that  were 
statistically  significantly  decreased  or  increased  in  airways  by  dis¬ 
tance  from  tumors  (site-dependent  analysis).  The  model  assumes 
that  each  FC  case  comprises  only  one  airway  brushing  at  a  spe¬ 
cific  spatial  location  (eg,  ainvay  1)  and  fits  expression  patterns  that 
change  in  a  gradient  fashion  with  different  airway  spatial  locations 
(airways  1-5)  classified  as  categorical  ordered  labels.  One-sided  t 
tests  were  used  to  determine  whether  site- dependent  differential 
airway  expression  exhibits  similar  directional  patterns  between 
NSCLCs  and  matched  normal  lung  tissues.  A  site-dependent  FC 
score  was  generated  to  collectively  signify  and  plot  gene  features 
that  are  differentially  modulated  with  respect  to  proximity  from 
tumors  across  all  airways  by  distance  from  tumors  (airways  1-5) 
and  between  paired  NSCLCs  and  uninvolved  normal  lung  tissues. 
The  score  was  calculated  by  the  sum  of  gene  features  upregulated 
in  airways  with  shorter  distance  from  tumors  minus  the  sum  of 
gene  features  downregulated  in  airways  with  shorter  distance  from 
tumors.  To  adjust  for  multiple  testing  in  all  analyses,  beta-uniform 
mixture  models  were  used  to  estimate  false  discovery  rates  as  previ¬ 
ously  described  (25).  Pathways  analysis  was  performed  using  inge¬ 
nuity  pathways  analysis. 

Statistical  Analysis 

Fisher  exact  test  was  used  to  determine  statistical  significance  of 
the  difference  in  airway  type  (cancer  vs  no  cancer)  between  clusters 
after  hierarchical  clustering  analysis.  Kruskal- Wallis  test  was  used 
to  test  for  statistical  significance  of  differences  in  quantitative  real¬ 
time  polymerase  chain  reaction  (QRTPCR)-based  expression  of 
genes  among  NSCLCs,  airways,  and  normal  lung  tissues.  Analysis 
of  variance  was  used  to  test  for  statistical  significance  of  differences 
in  QRTPCR-based  expression  of  genes  among  airways  with  respect 
to  proximity  from  tumors.  Student  t  test  was  used  to  assess  for  statis¬ 
tical  significance  among  different  groups  in  the  in  vitro  experiments. 
All  statistical  tests  were  two-sided,  except  where  noted.  Additional 
methods  including  all  details  and  R  codes  of  the  microarray  statisti¬ 
cal  analysis  are  included  in  the  Supplementary  Methods  and  in  the 
Supplementary  Sweave  Reports  (available  online). 

Results 

Identification  of  Adjacent  Airway  FC  Profiles  in 
Early-Stage  NSCLC 

The  FC  in  the  airway  adjacent  to  NSCLC  has  not  been  charac¬ 
terized  at  the  global  transcriptomic  level  before.  We  analyzed  the 
transcriptomes  of  cytologically  controlled  NSCLC  tumors,  paired 
uninvolved  normal  lung  tissue,  and  brushings  of  normal  airway  epi¬ 
thelia  collected  at  sequentially  varying  distances  from  the  tumors 
(n  =194  samples)  (Supplementary  Figure  1).  Samples  were  obtained 
from  specimens  surgically  resected  from  20  patients  (n  =  9  women 
and  1 1  men;  n  =  5  never-smokers  and  1 5  smokers)  with  stages  I  to 
III  NSCLC  (n  =14  adenocarcinomas,  5  SCCs,  and  1  not  othenvise 
specified  NOS  NSCLC)  (Supplementary  Table  1,  available  online). 

A  schematic  of  the  study’s  design  and  different  analyses  is  rep¬ 
resented  in  Figure  1.  We  first  sought  to  characterize  global  FC 
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Figure  1.  Schematic  of  the  transcriptomic  analysis  of  the  airway  field 
cancerization  (FC)  in  non-small  cell  lung  cancer  (NSCLC).  FC  samples 
comprised  of  matched  lung  tumors,  multiple  cytologically  controlled 
normal  airways  with  varying  proximity  from  tumors,  and  uninvolved 
normal  lung  tissues  were  obtained  from  20  resected  early-stage  (1- 
IIIA)  NSCLC  specimens.  Gene  expression  profiling  of  all  the  samples 
(n  =  194)  was  performed  to  characterize  global  FC  profiles  (differentially 
expressed  in  the  same  direction  between  both  tumors  and  airways 
compared  with  normal  lung  tissues)  in  the  normal-appearing  airway 

profiles  in  the  normal- appearing  airway  adjacent  to  the  NSCLC. 
We  identified  1661  gene  features  (n  =  457  upregulated  and  1204 
downregulated)  (Supplementary  Table  2,  available  online)  that  were 
statistically  significantly  and  concordantly  differentially  expressed 
between  both  NSCLCs  and  airways  compared  with  normal  lung 
tissue  (false  discovery  rate-corrected  P  <  .01;  fold-change  >  1.5) 
(Figure  2 A;  Supplementary  Methods,  Sweave  Report  2,  available 
online).  Further  analysis  identified  statistically  significantly  differ¬ 
entially  expressed  gene  features  between  the  airway  of  adenocarci¬ 
nomas  and  SCCs  (n  =  415)  and  between  the  aii'way  of  never-smoker 
and  smoker  NSCLCs  (n  =  51)  (Supplementary  Methods,  Sweave 
Report  2,  available  online).  Pathway  analysis  of  the  1661  gene  fea¬ 
tures  by  ingenuity  pathways  analysis  revealed  modulation  of  key 
cancer-associated  pathways  and  gene-interaction  networks  (all 
P  <  .05)  (Figure  2B).  The  most  statistically  significantly  (P  <  .001) 
modulated  pathway  and  the  gene  network  with  highest  number 
of  differentially  expressed  and  closely  related  (G-protein  coupled 
receptors)  interacting  genes  are  represented  in  Figure  2,  C  and 
D,  respectively.  These  findings  highlight  expression  patterns  and 
pathways  that  are  typically  deregulated  in  overt  tumors  but  are  also 
prevalent  in  histologically  normal  airway  epithelia. 

Analysis  of  Adjacent  Airway  FC  Profiles  in  Smoker 
Patients  With  Suspected  Lung  Cancer 

We  then  sought  to  ascertain  whether  the  adjacent  FC  profiles  may 
be  indicative  of  lung  cancer  among  smokers.  We  examined  the 
expression  of  the  adjacent  airway  FC  profile  in  a  cohort  from  Spira 
et  al.  (17)  comprised  of  129  large  airway  samples  from  smokers  with 


adjacent  to  NSCLCs  (site-independent  analysis).  Adjacent  FC  profiles 
were  then  analyzed  by  gene  set  enrichment  analysis  (GSEA)  in  a  set  of 
large  airways  of  smokers  (n  =  129)  with  and  without  lung  cancer  (17)  to 
identify  FC  profiles  that  can  distinguish  lung  cancer  among  smokers. 
Global  expression  profiling  was  also  used  to  delineate  FC  profiles  that 
are  modulated  in  the  airway  with  respect  to  proximity  from  correspond¬ 
ing  NSCLCs  (site-dependent  analysis).  Quantitative  real-time  polymer¬ 
ase  chain  reaction  (PCR)  was  used  to  confirm  the  differential  expression 
of  select  airway  FC  markers. 

and  without  lung  cancer.  After  matching  gene  features  between  the 
two  studies  by  common  Fntrez  Gene  identifiers,  261  upregulated 
and  749  downregulated  adjacent  airway  FC  gene  features  remained, 
which  were  used  to  perform  gene  set  enrichment  analysis  (GSEA) 
with  identifiers  ranked  according  to  the  Student  t  statistic  between 
smokers  with  and  without  lung  cancer.  This  analysis  demonstrated 
that  the  gene  features  that  were  downregulated  in  the  adjacent  air¬ 
way  FC  were  statistically  significantly  (P  <  .001)  enriched  within 
gene  features  downregulated  in  large  airways  of  smokers  with  lung 
cancer  compared  with  airways  of  cancer-free  smokers,  although 
such  a  statistically  significant  connection  was  not  observed  for  the 
upregulated  genes.  Leading  edge  gene  sets  comprised  of  299  genes 
(Supplementary  Table  3,  available  online)  that  were  concordantly 
modulated  between  NSCLCs  and  airways  compared  with  normal 
lung  tissue  in  the  adjacent  airway  FC  and  between  large  airways  of 
smokers  with  and  without  lung  cancer  (P  <  .001)  (Figure  3)  were 
then  derived  [as  described  previously  (26)].  These  data  suggest  that 
the  adjacent  airway  FC  harbors  potential  markers  for  detection  of 
lung  cancer  among  smokers. 

Site-Dependent  Analysis  of  the  Adjacent  Airway  FC  by 
Distance  From  Tumors 

We  then  sought  to  determine  whether  the  adjacent  airway  FC  tran- 
scriptome  varies  with  respect  to  tumor  proximity.  We  performed 
ordinal  logistic  regression  analysis  of  the  transcriptomes  of  airway 
samples  (n  =  81)  obtained  at  varying  distances  from  the  tumors. 
Using  a  5%  false  discovery  rate,  we  identified  422  gene  features 
(n  =  335  upregulated  and  87  downregulated  with  shorter  distance 
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Figure  2.  Identification  of  adjacent  airway  field  cancerization  (FC)  pro¬ 
files  in  non-small  cell  lung  cancer  (NSCLC).  A)  Hierarchical  clustering  of 
gene  features  statistically  significantly  differentially  expressed  between 
both  NSCLCs  and  airways  compared  with  normal  lung  tissues  (n  =  1 661 ). 
Columns  represent  samples  (n  =  194  samples  from  20  case  patients),  and 
rows  represent  gene  features  (red  =  upregulated;  blue  =  downregulated). 
B)  Functional  pathways  analysis  using  ingenuity  pathways  analysis 
(IPA)  of  the  differentially  expressed  genes.  Statistical  significance  of  the 


identified  overrepresented  canonical  pathways  is  indicated  by  the  nega¬ 
tive  log  of  the  P  values.  Functional  pathways  and  interaction  network 
analysis  by  IPA  depicting  the  most  statistically  significantly  (P  <  .001) 
modulated  pathway  (C)  and  the  gene  network  with  highest  number  of 
differentially  expressed  and  closely  related  (G-protein  coupled  receptors) 
interacting  genes  (D);  red  =  higher  experession;  green  =  lower  expres¬ 
sion.  Genes  selected  for  subsequent  confirmation  by  quantitative  real¬ 
time  polymerase  chain  reaction  are  highlighted  by  black  margins. 
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Figure  3.  Analysis  of  adjacent  airway  field  cancerization  (FC)  pro¬ 
files  in  large  airways  of  patients  with  suspected  lung  cancer.  Gene 
set  enrichment  analysis  (GSEA)  was  performed,  as  described  in  the 
Supplementary  Methods  (available  online),  to  identify  genes  that 
were  upregulated  and  downregulated  in  the  adjacent  airway  FC 
that  were  also  concordantly  enriched  between  genes  differentially 
expressed  between  histologically  normal  airways  of  smokers  with 
and  without  lung  cancer  (17).  Hierarchical  cluster  analysis  using 


the  299  leading  edge  genes  (n  =  59  upregulated  and  240  downregu¬ 
lated)  was  performed  side  by  side  in  the  adjacent  airway  FC  (A)  and 
in  airways  of  smokers  with  and  without  lung  cancer  (B).  Columns 
represent  samples,  and  rows  represent  gene  features  (red  =  upregu¬ 
lated;  blue  =  downregulated).  Pvalue  showing  statistical  significance 
of  separate  clustering  of  airways  of  smokers  with  lung  cancer  from 
those  of  healthy  smokers  was  obtained  by  the  two-sided  Fisher  exact 
test. 
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from  the  tumors)  (Supplementary  Taible  4  and  Supplementary 
Methods,  Sweave  Report  3,  available  online)  and  key  cancer- 
associated  signaling  pathways  that  were  differentially  expressed 
in  airways  with  respect  to  tumor  proximity  (Figure  4,  A  and  B). 
We  then  derived  a  quantitative  score  (Supplementary  Methods, 
available  online)  to  signify  the  extent  of  the  site-dependent  effect 
(Figure  4C).  It  is  worthwhile  to  mention  that  the  site-dependent 
effect  was  more  pronounced  in  lung  SCCs  than  in  adenocarcino¬ 
mas  (Supplementary  Figure  2,  A  and  B,  available  online). 

We  then  examined  whether  the  422  gene  features  were  modu¬ 
lated  concordantfy  between  NSCLCs  and  normal  lung  tissues.  We 
performed  one-sided  t  tests  of  the  gene  features  between  NSCLC 
and  normal  lung  tissues  to  identify  those  that  are  modulated  in  the 
same  direction  between  the  tumors  and  uninvolved  normal  lung 
tissues.  We  found  that  291  of  the  335  genes  that  were  increased 
and  53  of  the  87  that  were  decreased  in  airways  with  shorter  dis¬ 
tance  from  tumors  were  also  upregulated  and  downregulated, 
respectively,  in  NSCLCs  compared  with  normal  lung  tissues 
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(Supplementary  Taible  5  and  Supplementary  Methods,  Sweave 
Report  4,  available  online).  In  addition,  the  site- dependent  airway 
FC  score  was  statistically  significantly  and  concordantfy  modu¬ 
lated  between  NSCLCs  and  paired  uninvolved  normal  lung  tissues 
(Figure  4D).  These  findings  suggest  that  the  molecular  airway  FC 
in  NSCLC  is  in  part  localized  and  modulated  by  distance  from 
tumors  and  that  this  gradient  site-dependent  effect  in  the  airway 
recapitulates  NSCLC  expression  patterns. 

QRTPCR  Analysis  of  the  Differential  Expression  of 
Adjacent  Airway  FC  Markers 

We  performed  QRTPCR  analysis  of  the  expression  of  transforming 
growth  factor  beta  receptor  II  (TGFBR2)  and  vasoactive  intestinal  pep¬ 
tide  receptor  1  (VIPRl),  which  were  selected  based  on  pathway-based 
analysis  of  the  site-independent  adjacent  airway  FC  profile  (Figaire  2, 
C  and  D)  and  of  neuropilin  (NRP)  and  tolloid  (TLL)-like  2  (NET02) 
and  lysosomal  protein  transmembrane  4  beta  {LAPTM4B),  which  were 
among  the  FC  markers  (Supplementary  Tible  4,  available  online)  with 
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Figure  4.  Analysis  of  airway  expression  profiles  by  distance  from  cor¬ 
responding  non-small  cell  lung  cancers  (NSCLCs).  Ordinal  logistic 
regression  analysis  of  airways  was  performed,  as  described  in  the 
Supplementary  Methods  (available  online),  and  identified  422  gene  fea¬ 
tures  with  statistically  differential  expression  in  the  airway  with  respect 
to  tumor  proximity  (false  discovery  rate  <  5%).  A)  Clustering  analysis  was 
performed  as  described  in  the  Supplementary  Sweave  Reports  (available 
online),  and  airway  samples  were  arranged  by  difference  in  the  expres¬ 
sion  values  of  the  site-dependent  genes  between  the  upregulated  and 
downregulated  gene  clusters.  Columns  represent  samples,  and  rows 
represent  gene  features  (red  =  upregulated;  blue  =  downregulated).  B) 


Functional  pathways  analysis  by  ingenuity  pathways  analysis  of  the  site- 
dependent  differentially  expressed  genes.  Statistical  significance  of  the 
identified  overrepresented  canonical  pathways  is  indicated  by  the  nega¬ 
tive  log  of  the  P  values.  The  site-dependent  effect  in  the  adjacent  field 
cancerization  (FC)  was  quantified  as  described  in  the  Supplementary 
Methods  and  Sweave  Reports  (available  online).  Box  plots  depicting  site- 
dependent  FC  score  in  airways  (C)  and  between  corresponding  paired 
NSCLCs  and  normal  lung  tissue  after  statistical  analysis  by  one-sided  t 
tests  (D).  Heavy  lines  indicate  medians,  and  whiskers  indicate  maximum 
and  minimum  FC  scores.  Airway  distance  from  tumors  is  numerically 
indicated  with  a  range  of  1  (closest)  to  5  (farthest). 
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statistically  significantly  (P  <  .01)  increased  expression  in  airways  with 
shorter  distance  firom  tumors.  VIPRl  also  exhibited  statistically  signifi¬ 
cant  site-dependent  expression  modulation  in  the  FC  (Supplementary 
Table  4,  available  online).  QRTPCR  demonstrated  that  TGFBR2  and 
VIPRl  mRNA  levels  were  statistically  significantiy  decreased  and 
NET02  and  LAPTM4B  levels  increased  in  NSCLCs  and  airways  com¬ 
pared  with  normal  lung  tissue  {P  <  .001)  (FigTire  5,  A-D,  top  panels). 
QRTPCR  also  revealed  that  VIPRl  (P  =  .02)  but  not  TGFBR2  exhibited 
statistically  significantiy  decreased  expression  in  airways  with  respect  to 
tumor  proximity  (Figure  5,  A  and  B,  middle  panels),  whereas  NET02 
and  LAPTM4B  levels  were  statistically  significantly  increased  in  the  air¬ 
way  with  shorter  distance  from  tumors  (both  P  <  .05)  (Figvire  5,  C  and 
D,  middle  panels).  Microarray  and  QRTPCR-based  expression  of  the 
four  genes  were  statistically  correlated  (P  <  .001)  (Figure  5,  A-D,  lower 
panels).  These  findings  highlight  the  confirmed  differential  expression 
of  markers  in  the  adjacent  aii'way  FC. 

Effect  of  the  FC  Gene  LAPTM4B  on  Lung  Cancer 
Cell  Growth 

We  were  prompted  to  study  the  relevance  of  LAPTM4B,  a  lysosome- 
associated  transmembrane  putative  oncogene  (27)  with  no  known 


role  in  lung  carcinogenesis,  to  the  lung  malignant  phenotype.  We 
first  found  that  LAPTM4B  expression  was  statistically  significantly 
increased  in  BEAS-2B  immortalized  lung  epithelial  cells  compared 
with  normal  bronchial  cells  {P  <  .001)  (Supplementary  Figure  3,  avail¬ 
able  online)  and  in  NSCLC  cell  lines  (data  not  shown)  compared  with 
bronchial  epithelial  cells  after  in  silico  analysis  of  a  publicly  available 
dataset  of  human  NSCLC  and  bronchial  epithelial  cell  lines  [Gene 
Expression  Omnibus  dataset  GSE4824  (2  8)].  Transient  knockdown  of 
LAPTM4B  expression  in  immortalized  and  malignant  lung  epithelial 
cell  lines  effectively  and  statistically  significantly  reduced  LAPTM4B 
expression  (all  P  <  .001)  and  cell  growth  (all  P  <  .05)  (Supplementary 
Figures  4  and  5,  available  online).  Stable  knockdown  of  LAPTM4B 
in  Calu-6  lung  cancer  cells  statistically  significantly  suppressed 
LAPTM4B  expression  (relative  expression  compared  with  empty  vec¬ 
tor,  mean  ±  standard  deviation  [SD]:  shLAPTM4B  clone  1 : 0.2 1  ±  0.02; 
sh  T.APTM4B  clone  2:0.31±0.03;P<.001)  (Figure  6A)  concomitant 
with  statistically  significantly  reduced  cell  growth  (relative  cell  num¬ 
bers  X  10"^  compared  to  0  hours,  mean  ±  SD:  empty  vector:  5.83  ±0.38; 
scrambled  shRNA:  5.69 ±0.75;  shLAPTM4B  clone  1:  3.72  ±0.19, 
P  =  .01;  shLAPTM4B  clone  2:  3.67  ±0.60,  P  =  .02)  (Figure  6B)  and 
anchorage-dependent  (mean  colony  numbers  ±  SD:  empty  vector: 
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Figure  5.  Quantitative  real-time  polymerase  chain  reaction  (QRTPCR) 
analysis  of  airway  field  cancerization  (FC)  markers.  Expression  of 
TGFBR2  (A),  VIPRl  (B),  NET02  (C),  and  LAPTM4B  (D)  was  analyzed  by 
QRTPCR  in  18  of  20  NSCLC  FC  case  subjects  studied  with  sufficient  RNA 
from  airway  samples  left  over  after  expression  profiling.  Expression  of 
the  indicated  genes  is  depicted  by  group  (NSCLCs,  airways,  and  nor¬ 
mal  lung  tissues;  upper  panels)  and  across  airway  samples  based  on 
distance  from  corresponding  NSCLCs  (1  =  airway  closest  to  tumors; 
5  =  airways  relatively  farthest  from  tumors;  middle  panels).  Relative 
mRNA  expression  was  assessed  by  QRTPCR,  normalized  to  that  of 
TBP,  and  quantified  using  the  relative  quantification  method  as 
detailed  in  the  Supplementary  Methods  (available  online).  PCR  reac¬ 
tions  for  each  FC  sample  were  carried  out  in  duplicate.  Boxes  indicate 


±  standard  error  of  the  mean;  error  bars  indicate  standard  deviation. 
Statistical  significance  of  differences  in  expression  among  NSCLC, 
airway,  and  normal  lung  groups  was  assessed  by  the  Kruskal-Wallis 
test  (upper  panels),  and  statistical  significance  of  differences  among 
different  airways  was  assessed  by  analysis  of  variance  (middle  lanes). 
Correlation  between  expression  of  the  indicated  genes  quantified 
by  microarray  and  QRTPCR  analyses  was  statistically  assessed  by 
Spearman  rank  (lower  panels).  LAPTM4B  =  lysosomal  protein  trans¬ 
membrane  4  beta;  NET02  =  neuropilin  (NRP)  and  tolloid  (TLL)-like  2; 
TGFBR2  =  transforming  growth  factor  beta  receptor  II;  VIPRl  =  vaso¬ 
active  intestinal  peptide  receptor  1;TBP  =  TATA  box  binding  protein. 
*P  <  .05;  **  P  <  .001;  NS,  not  significant.  All  statistical  tests  were 
two-sided. 
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Figure  6.  Effect  of  RNA  interference-mediated  knockdown  of  LAPTM4B 
on  lung  cancer  cell  anchorage-dependent  and  -independent  growth 
and  colony  formation.  Calu-6  lung  cancer  cells  were  stably  transfected 
with  empty  vectors,  vectors  containing  scrambled  short  hairpin  RNA 
(shRNA),  as  well  as  vectors  with  LAPTM4B-sjpec\f\c  shRNA  sequences 
as  described  in  the  Methods  section.  A)  Quantitative  real-time  polymer¬ 
ase  chain  reaction  analysis  depicting  statistically  significantly  reduced 
LAPTM4B  relative  expression  in  two  Calu-6  sublines  {sh L A PTM 4 B-^  and 
s}r\LAPTM4B-2)  stably  transfected  with  two  different  LAPTM4B-s}pec\f\c 
shRNA  sequences  compared  with  Calu-6  cells  stably  transfected  with 
empty  and  scrambled  shRNA-containing  vectors.  B)The  indicated  stably 


31.0±9.17;  scrambled  shRNA:  30.3 ±2,89;  shLAPTM4B  clone  1: 
10.0 ±1.0,  P  <  .001;  shLAPTM4B  clone  2:  11.67 ±3.79,  P  =  .002) 
(Figure  6C)  and  -independent  colony  number  (mean  colony  num¬ 
bers  ±  SD:  empty  vector:  30.3 ±1.53;  scrambled  shRNA:  29.0±2,0; 
sb TAPTM4B  clone  1:  15.3  ±4.04,  P  =  .003;  shLAPTM4B  clone  2: 
17.0 ±2, 65,  P  =  .006)  (Figm'e  6D),  These  data  demonstrate  that 
LAPTM4B  is  a  positive  mediator  of  immortalized  and  malignant  lung 
epithelial  cell  growth. 

Discussion 

In  this  study  we  characterized  the  airway  FC  transcriptome  adja¬ 
cent  to  NSCLC.  We  identified  gene  features  that  were  statistically 


D 


transfected  Calu-6  cells  were  seeded  in  triplicates  cells/well)  in 

12-well  plates  for  72  hours,  after  which  the  number  of  cells  in  each  well 
was  computed  by  the  trypan  blue  exclusion  method.  Cells  were  seeded 
in  triplicates  at  a  seeding  density  of  250  cells  per  well  in  six-well  culture 
plates  or  150  cells  per  well  on  soft  agar  for  assessment  of  anchorage- 
dependent  (C)  and  -independent  (D)  colony  formation,  respectively.  Cell 
colonies  were  then  quantified  as  described  in  the  Methods  section.  All 
experiments  were  done  in  triplicate.  Representative  images  of  cell  colo¬ 
nies  are  depicted  in  the  lower  panels  and  were  obtained  with  a  phase- 
light  microscope.  Error  bars  indicate  standard  deviation.  *P<  .05;  **P< 
.001  by  the  two-sided  Student  ftest. 

significantly  and  concordantly  modulated  between  both  NSCLCs 
and  airways  compared  with  normal  lung  tissue,  a  subset  of  which 
was  indicative  of  lung  cancer  among  smokers.  Moreover,  we 
revealed  that  the  adjacent  airway  FC  exhibits  gradient  site-depend¬ 
ent  expression  patterns  with  respect  to  tumor  proximity,  which 
effectively  predicted  NSCLC  profiles,  pointing  to  their  possible 
roles  in  lung  cancer  pathogenesis.  In  addition,  LAPTM4B,  whose 
expression  was  increased  in  the  airway  with  shorter  distance  from 
the  tumor,  was  elevated  in  NSCLC  and  in  immortalized  lung  epi¬ 
thelial  cells  and  promoted  anchorage-dependent  and  -independent 
lung  cancer  cell  growth  in  vitro. 

Our  analyses  pointed  to  the  statistically  significant  differential 
expression  of  FC  markers — namely,  TGFBR2,  VIPRl,  NET02  and 
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LAPTM4B.  TGFBR2,  a  transmembrane  receptor  serine  threonine 
kinase  that  mediates  TGF-(3  signaling  (29),  was  previously  reported 
to  be  downregulated  in  invasive  adenocarcinomas  compared  with 
bronchioalveolar  carcinomas  (30),  and  loss  of  the  murine  coun¬ 
terpart  was  shown  to  mediate  progression  and  development  of 
invasive  adenocarcinomas  in  a  mouse  model  of  Kras-induced  lung 
cancer  (31).  VIPRl  mRNA  expression  was  shown  to  be  highest  in 
normal  lung  tissue  compared  with  various  human  normal  epithelial 
tissues  examined  and  to  peripheral  blood  leukocytes  (32).  NET02 
expression  was  reported  to  be  upregulated  in  proliferating  heman¬ 
giomas  (33)  and  associated  with  invasiveness  and  motility  of  cancer 
cells  (34).  Furthermore,  LAPTM4B  was  shown  to  mediate  prosur¬ 
vival  autophagy  and  chemoresistance  in  breast  tumor  cells  (35)  and 
was  found  to  be  upregulated  and  associated  with  poor  prognosis 
in  ovarian  and  hepatocellular  carcinomas  (36,37).  In  this  study  we 
showed  that  LAPTM4B,  whose  role  in  lung  cancer  was  previously 
unknown,  was  upregulated  in  NSCLCs  and  promoted  anchorage- 
dependent  and  -independent  lung  cancer  cell  growth.  Our  findings 
suggest  that  detailed  interrogation  of  the  airway  FC  may  be  a  use¬ 
ful  approach  to  highlight  potential  uncharacterized  mechanisms 
and  molecules  involved  in  lung  cancer  pathogenesis. 

Our  group  has  recently  portrayed  the  spatial  and  temporal 
molecular  field  of  injury  in  early-stage  NSCLC  patients  by  expres¬ 
sion  profiling  of  large  airways  after  definitive  surgery  (19).  It  is 
important  to  mention  that  in  our  previous  study,  normal  airway  epi- 
thelia  were  collected  by  endoscopic  bronchoscopy  brushings  within 
12  months  after  surgical  removal  of  the  tumor  and  when  NSCLC 
tumors  were  not  present  in  situ  at  time  of  collection  (19).  In  our 
current  study,  we  performed  expression  profiling  of  multiple  nor¬ 
mal-appearing  airways  at  various  distances  from  tumors  in  conjunc¬ 
tion  with  paired  NSCLCs  and  normal  lung  tissues  that  were  still 
in  situ  at  the  time  of  airway  epithelia  collection.  It  is  important  to 
note  that  although  the  identified  adjacent  airway  FC  profiles  were, 
in  part,  enriched  in  large  proximal  airways,  they  were  substantially 
dissimilar  in  the  normal  lung  tissue  samples  (Figures  2  and  3).  It 
is  reasonable  to  suggest  that  the  identified  field  effects  are  more 
readily  discerned  in  samples  (eg,  airways)  with  a  higher  fraction  of 
epithelial  cell  content  and  raises  the  notion  that  some  of  the  changes 
in  the  adjacent  airway  FC  may  be  related  to  cell  type.  However,  this 
notion  is  also  applicable  to  the  comparison  of  NSCLCs  with  normal 
lung  tissue,  an  analysis  that  is  commonly  performed.  In  addition  and 
through  analysis  of  normal  airways  alone,  we  identified  gradient  and 
localized  site-dependent  expression  profiles  within  the  adjacent  air¬ 
way  FC  that  predicted  NSCLC  profiles.  It  is  intriguing  to  presume 
that  gradient  site- dependent  airway  field  effects  may  be  associated 
with  the  development  of  NSCLC  tumors  in  a  particular  lobe  or  ana¬ 
tomical  region  of  the  lung.  However,  it  cannot  be  discerned  whether 
these  airway  FC  effects  are  a  cause  or  consequence  of  NSCLC 
development,  although  it  is  not  unlikely  that  they  are  involved  in 
lung  carcinogenesis,  as  embodied  by  the  effect  of  the  site-depend¬ 
ent  airway  FC  marker,  LAPTM4B,  on  lung  cancer  cell  growth.  This 
supposition  can  be  addressed  in  future  studies  by  assessment  of  the 
FC  in  lung  cancer  patients  before  and  after  surgery. 

Analysis  of  a  dataset  of  airways  from  smokers  with  suspected 
lung  cancer  (17)  revealed  that  a  subset  of  genes  in  the  adjacent  air¬ 
way  FC  profile  was  able  to  distinguish  lung  cancer  patients  among 
smokers  with  suspicion  of  the  disease.  It  is  important  to  mention. 


however,  that  the  airway  FC  markers  we  had  identified  to  be  modu¬ 
lated  with  respect  to  spatial  proximity  from  tumors  were  not  able  to 
statistically  significantly  identify  lung  cancer  among  smokers.  This 
observation  may  be  attributable  to  the  possible  strong  association 
of  the  gradient  site-dependent  profiles  with  the  compartments  that 
are  adjacent  or  local  to  the  tumor  (6,10).  It  is  plausible  to  surmise 
that  the  adjacent  airway  FC  harbors  molecular  cancerization  pro¬ 
files  that  are  clinically  relevant  to  both  the  detection  and  (chemo) 
prevention  of  lung  cancer  and  those  that  are  biologically  relevant 
to  understanding  the  early  pathogenesis  of  this  malignancy. 

It  is  worthwhile  to  mention  that  our  study  is  not  without  limi¬ 
tations.  Further  analysis  to  fully  characterize  how  the  adjacent 
airway  FC  varies  by  NSCLC  histology  (squamous  vs  nonsqua- 
mous)  and  smoking  status  (never-smoker  vs  smoker)  was  hindered 
by  the  limited  number  of  FC  case  patients.  Moreover,  although 
we  compared  and  contrasted  the  gradient  and  site-dependent 
airway  FC  profile  (n  =  422  gene  features)  that  we  had  derived 
from  NSCLC  case  patients,  among  different  subgroups  (eg,  by 
histology),  identification  of  site-dependent  profiles  unique  to  a 
histological  or  smoking  subtype  of  NSCLC  was  also  impeded  by 
the  small  number  of  FC  case  patients.  Additionally,  we  could  not 
connect,  at  the  present  time,  adjacent  FC  profiles  of  nonsmoker 
adenocarcinoma  patients  to  the  field  effect  in  the  large  airway 
because  of  the  paucity  of  such  airway  samples  from  nonsmoker 
lung  cancer  patients.  Furthermore,  it  is  reasonable  to  specu¬ 
late  that  RNA  sequencing  (38),  compared  with  gene  expression 
profiling  technology  that  we  used,  would  provide  a  more  thor¬ 
ough  characterization  (eg,  identification  of  novel  tissue-specific 
transcripts)  of  the  transcrip tomic  architecture  of  the  airway  FC. 
Nonetheless,  our  study  represents  the  first  attempt  to  character¬ 
ize  the  global  adjacent  airway  FC  in  NSCLC,  and  efforts  are 
underway  to  expand  this  working  model  into  different  subtypes 
of  lung  cancer,  including  never-smoker  adenocarcinomas,  and 
to  apply  more  advanced  technologies  and  platforms  (eg,  RNA 
sequencing)  for  studying  the  airway  FC. 

In  conclusion,  our  gene  expression  profiling  efforts  revealed 
that  the  adjacent  and  molecular  airway  FC  in  NSCLC  is  com¬ 
prised  of  markers  that  can  identify  lung  cancer  among  smokers 
as  well  as  gradient  and  localized  site-dependent  expression  pat¬ 
terns  that  recapitulate  NSCLC  profiles.  Our  findings  suggest 
that  profiling  of  the  airway  FC  in  conjunction  with  NSCLCs 
may  provide  additional  insights  into  the  biology  of  NSCLC 
and  the  development  of  molecular  tools  for  the  detection  of  the 
malignancy. 
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